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PREFACE TO THE SECOND EDITION 


F ROM its inception, the aim of this book has been to bring the reader 
to an intermediate level of attainment in the main branches of 
theoretical physics from which he may be able to proceed, with the 
help of special literature, to the held of research. Experience shows that 
it is often the first contact with a complex problem that presents the 
major difficulty, and it is here that help i3 most urgently needed. 
Nevertheless, it is necessary to njiaice a certain selection from the many 
topics that seem to offer promise of development. For example, there 
is the question of whether an extensive treatment of the Hamilton- 
Jacobi mechanics, which at the tune of the first edition (1932) formed 
the basis of the old atom theory, is still advisable. The decision to 
retain it is founded on the conviction that only in this way is the com- 
pelling and logical evolution of atomic physics in the form of wave 
mechanics clearly seen. 

Ensuing revisions of the book have made possible, the recasting of 
the presentation of many topics. The subject of nuclear physics re- 
quired, of course, the most extensive changes. In this connexion it is 
felt that a concise survey of the main lines of cosmic ray research is 
now in order, since knowledge in this field has progressed far beyond 
the stage of mere speculation. In this brief presentation much of the 
great mass of experimental material has been omitted, and it is hoped 
that specialists in tins field will not be too critical of what is apparently 
a somewhat oversimplified picture. Geometric optics has been given 
more space and the theory of the top has been modernized on the 
basis of a treatment suggested by Professor Bauersfeld. 

The range of subject matter has been extended by the inclusion of 
selected topics in what might be called “ applied theoretical physics 
To have included these items in the respective chapters dealing with 
these subjects would have interrupted the continuity of development, 
and so they have been grouped in a separate Part of the book. In 
response to numerous suggestions, a Mathematical Addendum on the 
properties of Bessel Functions and Spherical Harmonics has been 
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prepared. This material has been placed intentionally at the end of 
the work rather than in the Mathematical Introduction, in as much 
as it demands a somewhat higher order of computational skill on the 
part of the reader. 

Other changes include: (a) introduction of the M.K.S. system, 
chiefly in the formulas of macroscopic electromagnetism (there is no 
reason to change the familiar numerical relations in atomic physics); 
(6) addition of a considerable number of new exercises; (c) revision of 
the numerical values of physical constants; and (d) extension of the 
list of references for further study. It is hoped that these alterations 
will increase the usefulness of the work in its dual role of text and 
reference book. 

G. J. 


Boston, Mas3., and 
New Brunswick, N.J., 
U.S.A. 

April, 1950, 


PREFACE TO THE THIRD EDITION 

In this revision there are two additional chapters: (1) Phenomeno- 
logical Theory of Superconductivity, (2) Theory of Elastomers; and 
the chapter or. Nuclear Physics is considerably modified. A section on 
Fundamentals of the Matrix Calculus has been added to the first 
chapter and one on The Role of Lattice Defects in Dielectric Crystals 
to Chapter XLI. The Table of Physical Constants has also been revised. 
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THEORETICAL PHYSICS 


INTRODUCTION 

The Purpose and Methods of Theoretical Physics 

Purely experimental physics confines its activities to the exhibition 
of natural phenomena and the careful description of what occurs. 
One might, for example, observe the luminous effects produced by the 
passage of an electric current through a rarefied gas, record what is 
observed, and stop at that point; but to-day a procedure of this kind 
is justly regarded as being unsatisfactory. As soon as we inquire into 
the reasons for the phenomena, we enter the domain of theory, which, 
on the basis of hypotheses admitting of more or less direct test, con- 
nects the observed phenomena and traces them back to single “ pure ” 
phenomena, thus bringing about a logical arrangement of an enormous 
amount of observational material. 

In the given example of electrical discharge in gases, an “ explana- 
tion ” — i.e. a reference back to simple elements- -requires theoretical 
concepts concerning the eonstruction of the atom from electric charges 
and the mechanism of the production of charge-bearing particles. We 
thus find that the explanation of the phenomenon is far from simple, 
and that many effects operate simultaneously. Consequently, from 
the theoretical standpoint, we cannot look upon the phenomenon as 
“ simple ”, however easy it is to produce experimentally. Theory 
alone can decide whether the experimental conditions are pure, i.e. 
simple from a theoretical point of view. In the present example, an 
experiment in which electrons of measurable uniform velocity collide 
with gas atoms may be regarded as simple, in spite of the considerable 
experimental difficulties, since on the basis of such experiments one 
can attempt to give an account of the more complex phenomena of 
the glow discharge. 

While it is true that theory often sets difficult, if not impossible 
tasks for experiment, it does, on the other hand, often lighten the 
work of the experimenter by disclosing cogent relationships which 
make possible the indirect determination of inaccessible quantities, 
and thus render difficult measurements unnecessary. If it is required, 
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for example, to measure the reflecting power of a certain metal, the 
direct determination would require troublesome and withal not very 
accurate photometric measurements. Howevr , if we are familiar with 
the theory of metallic reflection (p. 346), we will measure, instead, two 
angles — the principal angle of incidence and the principal azimuth — 
from which the value of the reflecting power is completely determined. 

Whence does theory obtain its hypotheses concerning the connexions 
between individual physical quantities and events? In the last analysis, 
only from experiment. It is the art of the theorist to deduce from the 
data at hand, connexions of the most far-reaching significance, and 
to draw therefrom conclusions which suggest new experiments. 
There does not exist a theory which draws only upon itself, and which 
has no contact with experimental results. The number of possibilities 
is too large for even the greatest genius to be able to construct, by 
unaided intuition, a world-picture which agrees with experience. 
Because of their neglect of experiment, the scientists (natural philo- 
sophers) of ancient times exhausted themselves in speculations which 
only proved of lasting value where they were connected with obser 
vation, as in Astronomy and Mechanics. A train of thought of the 
nature of the well-known Hegelian dialogue * seems quite absurd to 
us to-day, despite the fact that when contradictions between experi- 
ment and the consequences of a well-founded theory arise, we look for 
a mistake in the experimental procedure as often as wo suspect an 
error in the calculations. In such a case, however, the theory has been 
set up to begin with so as to agree with observational facta. But while 
it is true that the first task of theory is to disclose relationships, an 
equally important part of its work is to formulate these relationships 
mathematically. Mathematics is the outstanding tool of the theorist. 
Its use represents a rationalization of thought, in that the process of 
deriving important conclusions from the initial hypotheses runs in 
the trustworthy channels of rules of calculation learned once for all. 
At the same time, one must not lose sight of the meaning of the com- 
putations. The beginner, especially, is inclined to ignore the physical 
meaning of his work, as contrasted with the purely formal calcu- 
lations. The position occupied by mathematics in the field of theoretical 
physics implies that it is not the problem of the theoretical physicist 
to devise mathematical proofs. He must rely upon the soundness of 
the tool delivered to him by the mathematician. Even in a case where, 
occasionally, he must construct his own tool, he need not trouble 
about mathematical existence theorems, provided the result is suffi- 
ciently evident on physical grounds. Indeed, the strict requirements 
of pure mathematics often contradict the physical facts. For example, 
if density is defined as the limiting value of the ratio of mass to volume 

• “ There oan be only seven planets.” “ Bat this it contrary to fact.” “ Bo muoh 
the worse for the facts." 
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as smaller and smaller volumes are taken, the existence of atoms leads 
to meaningless density fluctuations, the values depending upon whether 
the volume element encloses an atomic nucleus or not. The “ small 
quantities” prescribed by strict mathematics are indispensable in 
physics. Although we reckon with them as if they were “ infinitely 
small ” quantities, physical differentials have a not too small value. 
For such reasons, the ability to use the calculus as a" tool is far more 
important to a physicist than a knowledge of strict concepts such as 
the limit mentioned above. 

Theoretical and experimental physics cannot be separated, and 
a flawless, logically ordered picture of nature is obtained only by the 
application of both methods jointly. A rationally applied theory may 
have direct and immediate practical significance. If it is of impor- 
tance to obtain the highest possible efficiency in a large-scale technical 
process, there is only one way to proceed: we must ascertain the most 
favourable condition^ by exact calculations based on the theory; 
intuition is often deceptive. One need only recall, for example, the 
original pointed form of the airship, which seems to the great majority 
of people, if they are unfamiliar with hydrodynamics, to be the most 
effective form for cutting through the air. 

Thus an exact knowledge of natural lawB is sooner or later trans- 
muted into economic advantages, which, as a sort of “ premium on 
intelligence ” *, are gained by any society which organizes its industrial 
processes on rational principles. 

15 The expression was coined by ischottky in the seme connexion. 



PART I 

Mathematical Ihtbodoction 

In this textbook thb fundamental principles of f be - differential and integral 
oalculus are assumed to be known. Any mathematics required beyond this point 
is collected in four preliminary chapters, in order to avoid later interruption of 
the physioal train of thought by auxiliary mathematical discussions. A branch 
of mathematics of especial value to theoretical phjaics is Vector Anrdytiis, which 
has been developed largely by physicists. Its concepts have immediate psr- 
ceptual meaning, so that many physical laws show their full significance only when 
stated in the language of vectors'. It is therefore quite a mistaken view to regard 
vector analysis merely as a computational shorthand. 



CHAPTER I 
VaCTOE AKA1.V8TS 


i. The Concept of a Vector. 

Besides snc.ii magnitude?? as temperature, mass, and so on, which 
are characterized by the assignment ox a single number, an? 1 which 
are called scalar s, there also occur in physios quantities which arc not 
completely defined by the specification of a single number. The most 
important group of these is characterized by the fact that, besides the 
magnitude, the direction of the quantity must also be given. If, for 
example, we prescribe that an object, 
located by means of its centre of \ z 

gravity P, be given a displacement ! 



Fip. ! 


of 2 cm., then the point f" occupied by the object after displace- 
ment may be anywhere on the surface of a sphere of radius 2 cm. 
with centre at P. The new position of the centre of gravity will, not 
be determined uniquely unless wo specify also the direction of the- 
displacement in some manner. Two displacements of a point may 
be combined into a single equivalent displacement by a simple 
method (fig. 1). Instead of moving the point P to P' in a certain 
direction, and then displacing it in a new direction to P” s it might, 
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Instead of this, we might have moved P at once in the direction PP' 
a distance 

d = o z -j- 6 a -(- c®. 

We see, then, that the displacement of P is uniquely determined by 
specifying the three numbers a, b and c, which are called components 
of the displacement. 

A large number of physical quantities follow the same law of com- 
bination as the displacement of a point, and may, therefore, also be 
defined by three numbers. Such quantities are called vectors. We 
may represent a vector diagrammatically by means of a directed line 
segment (arrow), whose length, on any convenient scale, is numeri- 
cally equal to the magnitude of the physical quantity, and whose 
direction is that of the physical quantity. This geometric represen- 
tation gives vector quantities an important advantage over other 
non-scalar quantities arising in physics, e.g. tensors, for which no 
such simple model exists. 

Two vectors are said to be equal if they are identical with respect 
to both direction and magnitude. Since a pure displacement, i.e. a 
displacement without rotation, does not alter the direction, two vec- 
tors may still be equal, even when they lie in different (but parallel) 
lines. Physical quantities which are altered by pure displacement are 
therefore not immediately representable by vectors. To this class 
belongs, for example, the force acting on a rigid body capable of ro- 
tating about an axis: parallel displacement altera the lever arm of the 
force, and thus changes its effect. It is to be noted, further, that not 
every physical quantity which can be represented uniquely by a 
directed line segment may be treated as a vector; it is also necessary 
to investigate whether this quantity follows the Law of Composition 
obeyed by the displacement of a point. For instance, the rotation of 
a rigid body about an axis may be represented by an arrow whose 
direction is that of the axis of rotation, and whose length in centi- 
metres is numerically equal to the rotation in degrees. Moreover, the 
direction of the arrow may be specified so that the rotation is clock- 
wise when sighting in this direction. But we do not obtain a third 
rotation equivalent to the joint effect of two rotations by combining, 
as above, their representative arrows; the vectorial composition may 
be shown to be valid only for infinitesimal rotations. 

In this book, vectors will be designated by heavy letters (bold-face 
or Clarendon type). 

2. Addition and Subtraction of Vectors; Multiplication of a Vector by 
a Scalar. 

The composition of two displacements of a point, described in 
§ 1 (p. 7), has, as wifi be shown at once, ail the characteristics of a 



I.] 


VECTOR ANALYSIS 


summation. In general, we define, therefore, as the sum of two vectors 
A and B, a vector C obtained by laying off the vector B from the ter- 
minal point of A and then drawing a vector from the initial point of 
A to the end point of B (fig. 3). This Combination is written as a vector 
equation 


C = A H 


(1) 


A vector equation is equivalent to three scalar equations, Bince, as was 
shown in § 1 (p. 8), a vector is determined by specifying its three 
components, and two vectors are equal 
only if the three components are respec- 
tively equal. For if any vector is con- 
structed as the combination of three 
vectors respectively equal to it3 com- 
ponents, two equal vectors can be 
obtained only if their corresponding Fig. f 

components are identical. 

This geometric sum is commutative, like an ordinary sum; i.e. the 
value is independent of the order in which the summation is 
performed. As is seen from fig. 3, we 
obtain the same vector, apart from the 
immaterial parallel displacement, by laying 
off A from the end point of B and drawing 
a line from the initial point of B to the 
terminus of A. Further, a vector sum i3 
associative) i.e. in the case of a sum of 
several vectors, the individual terms may 
be grouped in any arbitrary manner: 

S = f A + B) + C = A + (B + C) 

= B+{A+C) . . (2) 

The proof of this is evident from fig. 4. 

The sum of two vectors having the same 
direction and sense is obtained by addition Fi*. 4 

of their lengths, the direction remaining the 

same. From this follows at once the definition of the product of a 
vector and a pure number. According to the meaning of multiplication, 
mA means the sum of m terms, each one being A, i.e. a vector having 
the same direction, but m times as long. This leads to an important 
representation for a vector: if the direction is specified by a vector 
of length 1, any vector having this direction may be represented by 
multiplying this unit vector e by the magnitude of A, which may be 
designated by the corresponding Roman letter, or by j A |: 

A= Ae— | A|e (3) 

If we think of a vector as being compounded from three vectors having 
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the directions of the axes of a rectangular co-ordinate afvimm, and if 
we denote the lengths of obese vectors ( which we shall call the rect- 
angular components of A}, by A x , A„, A„ and if *, j, k are the unit 
vectors is the directions of the axes, then, the original vector is given, 
in terms of. its components, as 

A = A e i -h A^j -4* AJt. (4) 

The difference of the vectors C and A. is a vector B which when 
added to A gives the vector C. As is seen from fig. 3 (p. §), we obtain 
B by laying off A, with its direction reversed, from the terminus of C, 
and then connecting the initial point of 0 with the end point of A 
From this it follows at once that the meaning of the multiplication of 
a vector by {— I) is merely reversal of direction, for, retaining the 
formal laws of ordinary arithmetic, we may look upon the subtraction 
of the vector A as the addition of a vector — A. 

Ex I. Express, by means of an equation, the fact that three vectors A, B, 
aad C form a closed triangle, all three vectors describing the perimeter in the 
asms direction. 

Ex. 2. Express aaob. of toe following by an equation: 

j a) Two vectors A and B are parallel. 

(6) Three vectors A, B and 0 ate oopi-inar. 

8. The Scalar Product of two Vectors. 

In physios there arias certain coiablnataons of vectors which possess 
the more important properties of product**. We define as the scalar 
product of two vectors, a number (scalar) equal- to the product of the trutgm- 
ludee of the two vectors mvhvplied hy the cosine of the angle included 
between ikm. The scalar product is positive if the included angle is 
acute, and negative if the angle is obtuse. Then, if we agree to indicate 
the scalar product by writing the tito vectors alongside each other * 
the definition of the scalar product reads 

AB — AH cosiAB), (5) 

A cosi.AB) > quid f the * ojemi.u cl she *ecU A upon cb> i cion 
of 8, the scalar product may also be defined as the product of the 
magnitude of sue vector by the projection of the other upon it. A 
simple e.xp?t*aoa folkwe for this projecticn of a vector A upon a 
direction given by the unit vector e: it is merely the scalar product As 

The scalar product has the property of commutativity: 

AB ^ BA (6) 

* If, is soiaotimsE convenient to writ© the scalar product within brackets, in which 
case the brackets used are round-, e.g. the product of the two scalar products AB and 
CD may be written (AB) (CD)- The scalar product itself is written (AB) or (A - B) by 
some authors. 



Lj VECTOR ANALYSIS 11 


for, in the defining equation (5) {p. 10), the order of the factors is 
immaterial. 

The scalar product possesses, further, the most important property 
of an ordinary number product-— that of being distributive with re- 
spect to addition. This is expressed by the equation 

A(B + C + D + ...) = AB + AC 4- AD -f . . . . . (7) 



Fig. s 


It may be seen from fig. 5, that the projection, on the direction 
of A, of the sum of the three vectors 
is the same as the sum of the projections 
of the vectors, whence, on account of the 
wav the scalar product was defined, equa- 
tion (7) is verified. 

However, the scalar product of two 
vectors , differs from the product of two 
numbers in one respect. The latter 
vanishes only if one or both of the 
factors are zero; but the scalar product vanishes also when cos(AB) 
ip, zero, i.e. if the two vectors are perpendicular. Thus, for the 
three unit vectors of a rectangular co-ordinate system, denoted as 
before by #, j, h, ws have 

ij —jk ki~ 0 . ( 8 ) 

The scalar product of a given vector by itself i 
of the vector. Since cos (AB) = 1 in this case, the 
square of the magnitude of the vector: 

A? ~ & A — A % , ...... (9) 

and, reel 


s termed the square 
result is simply the 


jaily, the magnitude may be written 

d = |v'Fj. . . 


(10) 


Thus, for the unit vectors, 

it =jj -■ hh — L (11) 

If the rectangular components of two vectors art-, given, 

A A v i -}- A, t j + A t k and B — B m i -■*- B v j -{- B,k, 

we obtain, an account of the distributive props: ty, by multiplying 
out in the usual manner, 

AB =-= A a B x ii -f A„B,ij + . , 

which, by equations (8) and (11), reduces to 

AB = A„.B 0 4- A..B V 4- A x 3 t . . . . ( 12 ) 

Thus the sealer prod-mi 4 two vectors is equal to ike sum of the -products 
if ‘hi- corresponding component'; 



12 


MATHEMATICAL INTRODUCTION 


[Chap. 


In rectangular co-ordinates, the components are identical with the 
projections upon the axes, so that 

A x — s' A (13) 

It is immediately evident from this result that if C is the vector sum 
of A and B, then each component of C is equal to the sum of the cor- 
responding components; for if equation (1) be multiplied by * there 
results immediately 

O m **A m + B m (U) 

Equation (13) is no longer true for oblique co-ordinates, as may 
be seen from fig. 6, for the two-dimensional case. If we regard as com- 
ponents of A the partial vectors in the 
directions of the axes (from which A may 
be obtained by vector addition), then it 
is evident that these partial vectors are 
not equal to the projections upon the 
axes. But the terminus of A may be 
determined uniquely by specifying the 
projections upon the axes, as well as by 
giving the partial vectors. The former 
; are, therefore, in a sense, also components 

•Aac of A. Because of their different behaviour 

Fig. 6 with respect to linear transformations, 

the components of the first kind, which 
yield A by vector addition, are called the contravariant components, 
and the projections are called the covariant components of the vector. 
This distinction is important in cases where the use of oblique co- 
ordinates is essential, as in the Generalized Theory of Relativity. 

Ex. 3. Give the geometric significance of 

(A + B) J = A a + 2AB + 3 2 . 

Ex. 4. What iB the meaning' of (A -j- B) (A — B) for the case where A* ■= B*T 

Ex. 5. Calculate the angle between the two vectors 

8„ => ooaa n f + eaa&J + cosy,, A { n = 1, 2). 



4. The Vector Product of two Vectors. The Directed Plane Area as 
a Vector. 

Besides the scalar product, there is another equally important 
product-like combination of two vectors, which is itself a vector. The 
vector product of two vectors A and B is defined as a vector P which is 
perpendicular to the plane determined by A and B, and whose magni- 
tude is equal to the area of the parallelogram formed by A and B, 
i.e. equal to AB sin { AB). The sign (sense) of P is so determined that, 
sighting along P, the shortest rotation from A (the first. factor), to- 
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ward B (the second factor), is clockwise (fig. 7). This stipulation 
denies the vector product the property of commutativity possessed by 
the scalar product; for if we reverse the order 
of the factors, then the turning of B toward A 
must be clockwise when looking in the direc- 
tion of P, which means that P is now reversed. 

The vector product is indicated by writing 
the two vectors alongside each other, and 
enclosing them in square brackets* Thus 
p = [AB] == — [BA]. . . (15) 

The magnitude of the product vector is f 

P — AB sin(AB), . . . (15') Fi * T 

The most important property of the vector product, without 
which the use of the term product would not be justified, is that of 
distributivity with respect to addition. That is, 

[A(B+0+D+...)]-[AB]+[A0J+[ADl + ... . (16) 

The proof of this formula is somewhat cumbersome, but not difficult, 
and will not be given here. 

Just as the scalar product of two non-vanishing vectors becomes 
zero when the two vectors are perpendicular, so the vector product 
vanishes if the factors are parallel. The equation [AB] — 0 expresses 
the fact that A and B are parallel, provided that A and B differ from 
zero. In the same way, [AA] = 0. For the unit, vectors along the axes, 

[ir\^W\=\kk-\^0, (17) 

[*73 = k; Uk] = i; . . . (18) 

Using these relationships, the application of the distributive property 
gives the following representation of the vector product in terms of 
the components: 

[AB] = [(A x i + AJ + A,k) (BJ + B v j+ B,k)} 

= (A v B t - A t B v )i+ ( A t B, - A x B,)j + (A„B U ~ A y B m )k. 
This may be written as a determinant: 

j i j £ I 

[AB] = A x A v A A .... (19) 
i B x B v £,| 

The change in sign of the vector product with reversal of the order 
of the factors manifests itself here as the well-known law that the 
interchange of two rows changes the sign of the determinant. 

* Besides the square brackets, the notation A x B is often used. 




14 MATHEMATICAL INTRODUCTION [Chap. 

The specification of the product vector P gives the following in- 
formation concerning the parallelogram formed by the two vectors 
A and B; (I) its position in 3pace (the plane perpendicular to P); (2) the 
.relative position of the sides A and B with respect to one another; 
(■}) the area, numerically equal to the length of P. The form of the 
parallelogram is not given by P. In looking upon the form as being 
unimportant, we are recognizing the fact that every portion of a plane 
may be represented by a vector whose direction and magnitude are 
uniquely determined, according to the sfbove rules, by she orientation, 
direction of description, and area of the figure. 

For the vectors S< representing the races of a closed polyhedron, 
the following theorem holds: If the representative vector ie as- 
signed to each face in such manner that it is directed towards the 
outside of the solid Is ->vety case, then the sum of those vectors 
vanishes: . 

"S S 4 — 0 for closed polyhedra. . . (20) 

This theorem it easily verified for a tetrahedron. If we denote the 
three vectors emanating from the vertex 
(fig. 8) by A, B and C, then the three 
feces passing through this vertex are [AB], 
[BC] and [C’A'j. The base is represented 
by [W — A) (B - A)], If this is expanded, 
and the resulting vector products added to 
the above three, the result is zero. Tow 
any. polyhedron may be “subdivided into 
a number of tetrahedra. The above 
theorem holds icr each of these com- 
ponent tetrahedra, and every surface in- 
serted by the -ufcd “i I ton in the interior cf the solid enters into the 
calculation twice, with oppositely directed normals, so that in forming 
the sum, the extra surfaces formed by cutting the original solid into 
tetrahedra cancel out, leaving only she sum over the external surfaces. 
We can go a step further: if we have an arbitrary closed curved 
surface, we can approximate to it by a polyhedron whose faces are 
portions of the tangent planes. Ia the limit, when the faces dS become 
vanishingly small, the polyhedron and the actual surface coincide, 
and the sum passes sato an integral 

<p as = 0 for dosed surfaces. . . . (21) 

The sign $ indicates integration over a closed region. A closed surface 
of this kind is often termed a “ shell this type of integral, which 
n&f be two# t shell integral ”, will be met with often in vector 
analysis. 
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In spite of the fact that we can assign the product vector Fuaiquely 
to the parallelogram formed by two vectors, there is a certain dif- 
ference between the parallelogram and its representative vector. If, 
namely, the parallelogram be reflected in its own plane, the direction 
of one side with respect to the other remains unaltered, but the sign 
of the product vector changes. This fundamental difference between 
vectors like displacements (polar vectors) and those which represent 
a direction of rotation (axial vectors) comes out dearly when we 
change from a right-handed to a left -banded system of co-ordinates. 
But since such a change is generally unnecessary and superfluous, 
this difference is of no consequence in actual calculation. On the 
ether hand, in the four-dimensional vector analysis of the Theory of 
Relativity, the distinction is of essential importance; in that case v 
polar vector has four components, while an axial vector has six 

Mr. 3. Verify (16), using an expression containing only two terms. 

Em. T, What is the value of [ABj* -f (AB) a ? 

&#. 8, In solid analytical geometry, sill theorems concerning lines and planer 
may be derived with little calculation if we operate with the radio* vadot r «ud 
specify dtrec‘ic*u by unit -rectors, instead of using Ose-ftsiai? oo-ordimUWs. la 
tikis manner give: 

(а) the equation of & plane whose normal direction is that of *• and wooes- 
distance from the origin is p,. 

(б) the distance of a pr int r„ from this plane, 

(?) the eq uation of the plane parsing through three, points r v r* r s „ 


5, Multiple Products. 

(e) Product of a vector with the senior fevdvet of two 
other vectors 

Since a scalar product is an ordinary number, the product- or. a 
vector A with the scalar product BC mean, felt* mmtipttcataon oi A 

with this number, according to the rule given in § 2 (p. 9), i.o. a vector 
having the direction oi A. This is written A (BO) or (BO) A. It is to be 
noticed that (AB)O has »u entirely different meaning from A(BC), 
y» 4 u the former is a vector in the direction of 0, 

(b) Scalar product of a vector with the vector product- of im 
osh.ee vectors 

If P — [AB], then the product OP — CfAB] t» the volume of the 
parallelepiped having th«s three vectors A, B, %o -J 0 as contiguous 
edges, for the magnitude of P is equal to tea area of the parallelogram 
formed by A and B (fig. 9). Thus 

OP = CP cos (OP), 

is the volume of the parallelepiped, since 0 cop (OP) is cqrsal to its 
altitude. One might equalfy well consider the bass of the parallelepiped 
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bo be formed by tbe vectors C and A, and then form the scalar product 
of this quantity with B. This means that the volume can also be 
, represented by the product [CA]B. Because of 
this property, the square braokets are unneces- 
sary, and from now on such a triple product 
will be denoted simply by writing the three 
tors alongside each other. However, the 
_ 1 of the vector •product changes each time the 
cyclic order of the factors is changed. The 
effect of interchanging two factors is seen from 
fig. 9. [BA] is a vector directed downward, 
and hence cos (CP) becomes negative. We may 
set up the following rule for the sign of the triple scalar product: 
The product ABC is positive if the three vectors are relatively 
arranged like the axes of a right-handed co-ordinate system. Thus 

ABC = BCA = CAB = — ACB =* - BAC = — CBA. (22) 

If the components of the three vectors are given, then 
ABC= {AJ+ A„j+ A t k) {(B v G t —B t C y )i + B x C,)j 

+ (B X C V - B y C x )k}. 

This may be written as a determinant: 

\A. A, A, \ 

ABC = \B X B v B J (23) 

\C X C„ G, | 

Equations (22) thus express the well-known, property of determinants 
according to which the interchange of 
two rows causes a change in algebraic 
sign. 

(c) Vector product of a vector with the 
vector product of two other vectors 

If P = [BC], then the vector product 
R = [AP] = [A [BC] ] 

signifies a vector lying in the plane of 
B and C, for P = [BC] is peifpendieular 
to this plane, but R = [AP] is in turn perpendicular to P, and so 
falls in the plane of B and C (fig. 10). 

Since any vector V lying in the plana of B and C may be written 
in the form 



V = aB+ JSC 
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by suitably choosing the scalar multipliers a and 8, it must have this 
form, and it remains to determine the numbers a and j8. To this end, 
we introduce a co-ordinate system in such manner that the x-axis is 
in the direction of C and the y-axis is in the plane of B and C. This 
may be done without restricting the generality of the discussion. We 
then have 

A — A x i - f A v j + A t k, 

B = B x i+ B v j, 

C = CJ. 

The vector product [BC] then assumes the simple form 
[BC] = — B v C x k, 

and [A [BC] ] = — A v B v C x i -f A X B V GJ. 

If the vector A x B x C x i be added and subtracted, there results 
[A [BC] ] = A X G X (BJ + BJ) - {A X B X + A V B»)CJ. 

This is equivalent to the important formula 

[A[BC]]~ B(AC) — C(AB). .... (24) 


The coefficients a and j8 are thus the scalar products AC and — AB. 

With the aid of this formula, complex products may be converted 
to simpler forms. Take, for example, the scalar product of two vector 
products: 

P =-- [AB] [CD], 

We put [AB] — E and obtain, by (22) (p. 16) 

P = E[CD] = — C[ED] = — C[ [AB] D], 
and then, using (24), 

P= C (A(BB) — B( AD) }. 

Hence [AB] [CD] = (AC) (BD) - (BC) (AD). . . (25) 

Ex. 9. Transform [[AB][CD]]. 


6. Differentiation of a Vector with respect to a Scalar; Application to 
the Theory of Space Curves. 


Let a vector V be a continuous function of a continuous scalar 
variable m: , , 

V — v{u). 


If the variable 
amount 


is increased by Am, the vector will change by an 
Av — i>(u+ Am) — v(u), 
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and, in complete analogy with scalar functions, we define the deriva- 
tive dvjdu as the limit 

v'-fZ- Km ”<«+*«> -."M . . m 

du Aw _>o Aw 

Denvatives of a higher order are defined similarly, e g.: 


. ( 27 ) 

du du 2 : \u-*q Au 

If the position vector r of the points on a space curve ha given as 
a function ox the length of arc s, measured 
t- jf from a given initial point, then the magnitude 
■ of A r is identical with A#, and the limit of 
ArjAs is a vector of length i, having the direc- 
P^jff tion of the tangent to the curve (fig. II). This 

/\ lr+dr umf tangent vector is denoted by t. and we have 


The rules for the differentiation of products 
also correspond to those for ordinary scalar functions, aa may be 
seen readily by writing the derivatives as limits. In the case of the 
vector product, however, the order of the factors is important. Wo 
thus have 

d{AB) . dA*, . dB . 5 A /oa . 

du du du du du 


Since neither dvjdu nor v is to vanish") this means that the derivative 
of a vector of constant length is perpendicular to the vector. This is 
also evident geometrically, for if the length, is constant, the end point 
of the vectf is restricted to move on a sphere. If the increment is 
infin itesimal- j.fc is tangent to the sphere, and hence is perpendicular 
to the vector itself. 

This result- may be applied' to the unit tangent t. Since this 
vector is always of unit length, ita derivative must always be 
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perpendicular to i , and so must be a vector in the normal plane 
to the curve. But this derivative, being the vector difference of 


two consecutive tangent vectors, 
must lie in the osculating plane 
formed by the latter, and so its 
direction, is that of the principal 
normal, which direction we de- 
signate by the unit vector n. 
In order to calculate the magni- 
tude of the vector dtjds, we note 
that the curve, in the neighbour- 
hood of two consecutive tangents, 
corresponding to three neighbour- 
ing points, may be replaced by 
the circle of curvature, whose 
centre M is determined by the 
intersection of the perpendicu- 
lars to the two consecutive 
tangents (fig. 12). The angle d<f> 
between these tangents is the 
same as that between the two 
radii of the circle of curvature, 
then, in the limit, 



ds — pdfi. 


(81) 


On tbs other hand, j dt | — dsp, whence 
i 

1 ds f p 

Thus is obtained the important equation 
di __ dV_ n 
ds ds a p 


(32) 


The value of the curvature io obtained from this result by squaring 
(scalar product) and extracting the root: 


k~ 


1 _ {7&r\\ 

p y } 




(33) 


The unit vector b perpendicular to the osculating plane, gives the 
direction, of the binomial. If we stipulate that the three vectors i, n 
and b, in' this order, form a right-handed triad, then 

b = [tii-j. (34> 


For a plana curve, the osculating plane (and therefore the binomal) have 
constant direotioa, so that db/dt vanishes. For a non-plaaa (skew) mure, this 
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derivative may b© looked upon as a measure of the seoond curvature, and its 
magnitude is known as the torsion t. The direction of db/ds is found in the follow- 
ing manner: From the properties of the unit vector, db/ds must be perpendicular 
to b, i.e. parallel to the osculating plane. Further, since b is perpendicular to 
i, bt mm 0, and so 


d(bt)_ 

da 


Since, also, 6 is perpendicular to 


= p (dt/ds), there results 


0 , 


i.e. the vector db/ds is perpendicular to t. Since the former is parallel to the 
osculating plane, it has the direction of the principal normal n. To fix the sign, 
we specify that the torsipn is to be reckoned positive if the rotation of b is clock- 
wise when looking in the direction of t. This gives the vector equation 


drb 

ds 


(36) 


These examples show how much the use of vector analysis simplifies and 
clarifies the material of differential geometry. In what follows, we shall require 
only the principles presented above, and the reader is referred to modern text- 
books on differential geometry for further applications. 

Ex. 10. When is [a^] -=0T 

Ex. 11. An arbitrary surface A has a plane boundary curvj. Form f dh — S 
and show that [6Sj = 0. J 


7. Space Derivatives of st Scalar Quantity. 

If a scalar quantity — the temperature, for example — is given as 
a function of a point in a region, we speak of a scalar field. Let the 
scalar point function have the value u at- a certain point. The change 
in the value of this function, corresponding to a displacement of amount 
ds, ‘will depend on the direction of the displacement, i.e. the derivative 
iujds may have any one of au infinity of values, depending on the 
direction of the displacement ds. It is easily shown, however, that we 
can specify the change in u in any direction, provided we know a cer- 
tain vector obtainable by means of differential operations. In rect- 
angular co-ordinates we have 

ds = dxi dyj + dzk. 

Again, we have du — -f -f 


Evidently, then, du may be regarded as the scalar product of ds with 
a vector 


du.du.9u. 
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which is called the gradient of u, and is written gradu: 

, 3u . , 3 u . , 3u , 

% I&du== d'x t + di J + di k> • • • • ( 3 6) 

that is, du — ds (gradu). ...... (37) 


The meaning of the gradient may be seen in this w*ay: Leaving dis- 
continuous functions out of considera- 
tion, and dealing only with continuous 
point-functions, we can connect all 
points in the region having the same 
value of m by means of surfaces — 
so-called level surfaces (fig. 13). Each 
of these surfaces is characterized by 
the fact that displacements wholly within 
the surface do not alter the value of 
ti. If, then, we let ds Q lie in one of the level surfaces, we have 
ds 0 (gradu) — 0. 

But since neither ds 0 nor gradu is to vanish, this means that the vector 
gradu is perpendicular to the level surface. 

Further, from the definition of the scalar product, it follows that 
the increment du has its greatest value — the magnitude of ds remain- 
ing constant — when ds and gradu are 
in the same direction, or, in other 
words, the vector gradu gives the 
direction of greatest change (“ slope ”) 
of the point-function u — hence the 
name “ gradient According to (37), 
its magnitude is equal to dujds if d s 
is perpendicular to the level surface. 

The change of u in any other direction 
is obtained, according to (37), by multi- 
plying by the cosine of the angle <f> 
between the gradient and the direction 
of displacement. This suggests a simple graphical construction for 
the several values of du, for a ds of fixed magnitude (fig. 14). Two 
lines of length equal to the magnitude of gradu are drawn through 
P 0 and normal to the level surface, one above and one below, and 
the spheres having these two lines as diameters are described. Then 
a secant drawn in any direction gives the magnitude of the change in 
that direction, in relation to the change du in the direction of the 
gradient. It must be remembered that ds is held constant here. 

The derivative of u in the direction of 6' is denoted by dujds, and 
is thus obtained by projection of gradu on the direction s. 


or ad, a 



grad a 
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Numerous examples of th© physical occurrence of the gradient concept will 
be me! ®^ih in many section# of this work. One example may be given now. 
A weather map has marked upon it the lines of equal air pressure (isobars). The 
direction of the wind is then given, apart from the effect of the earth’s rotation, 
by the direction of greatest pressure, drop, whioh is perpendicular to the iso- 
bars; and the strength of the wind is determined by the magnitude of the pres- 
sure drop, 

By introducing the gradient we have obtained a vector field, that 
of gradu, from a scalar field, for grad a is a vector magnitude which 
alters in value from place to place in the region. However, not every 
vector field ipay be obtained Iifom a scalar field in this manner. If a 
' vector field has this property, a result oi 

great importance follows: Imagine a curve 0 
drawn in the vector field (fig. ID). Let this 
curve consist of a succession of line-elements 
ds. The Une integral of a vector t> along the 
curve C is defined as the limit, for ds in- 
finitely small, of the sum of the scalar pro- 
ducts vds, the summation to begin at P g 
and end at P v If now V is the gradient, of a scalar u, this line integral 
between P 9 and P t is independent oi the form of the curve C, for 
the individual magnitudes 

vds = ds (graolw) = du 

represent, according to the definition of the' gradient, the change in 
the function « in moving along a distance ds. The sum of these changes 
is the difference .between the values of « at P x and at P 0 . and this dif- 
ference is independent .of the path by which one passes from P e to P v 

If there are two paths, 0 and C , then reversing the direction of 
motion along"!/' gives with C a closed curve which is described in one 
direction. But since the reversal of O' causes all its line elements ds 
to reverse their sign, and since the value of the fine integral is the 
same on both portions of the curve, the integral along the path 
vanishes. We thus have the important theorem: If a vector fidd is 
representable as a fidd of the gradient of a scalar pomi function, ike value 
of the, line integral of the vector, taken between two points in the region, 
is independent of the path; and the line integral over a closed path 
vanishes. Thus 

f' (grad») ds = «* — ««, . • . m 

and <j) (gradw) ds ~ 0. ....... pi'?/) 

From, the definition (86) of the gradient, a useful rule follows: 
If u is a function of a simple point function v, then, since 
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2is _ L du dv 
dx dv dx* 

gra^M = £* gradw (39) 

Ex. 12. What is the meaning of grad rt (r<=* distance from origin.) 

Ex. 13. Let a soaiar point function depend only on the distance r from the 
otigin. Compute grad /(r), 

| The Concept of Divergence and Gauss's Theorem, 

If an arbitrary vector field sa given, we may, as in the analogous case of § 6 
p. 3.7), wish to determine the change in the fiaM vector v accompanying a die- 
placement is. As may readily he seen, the calculation of thh; change necessitates 
a knowledge of three gradients, viz. those of the rectangular component^., which 
give the amounts of the changes of the components. We thus arrive at a set of 
nine scalars (three vectors). Quantities of this nature will not be met with until 
ds.iar, and so we put this question aside, and taka up another physically significant 
differential operation, which leads from e vector field to a scalar field. 

In order to understand this operation more Nearly, we give the 
field vector a special meaning. Let. the vector v represent, in. direc- 
tion and magnitude, the volume of fluid 
passing, per second, through an area of 
i aq. cm. perpendicular to v (fig. 16). 

The amount of fluid passing through tbs 
element of area AS, each second, is then 
given by the scalar product »AS; for 1 ,6 

the quantity passing through is given 

by v„ the component of the flow perpendicular to the surface element. 
Let us now enclose a space r in the vector field iy means of an arbi- 
trary closed surface. Call the elements of this surface dS and let them 
be so oriented that the normals are directed outward. The amount of 
fluid passing outward through an element dS, in unit time, is therefore 
v dS; inward flow has the negative sign, Lvuirmation of these elemen- 
tary contributions over the entire surface ghee the exeoas amount of 
dfiid passing to the outside each second. This yields the surface in- 
tegral. tfi vdS. If there are »i© sources within the euriaeo, the integral 
meet vanish. If sources exist within, the integral famishes a measure 
of the combined strength of ail source* within the bounded 
volume. The total strength depends ak» an ;hs vohsae, and 
Lie important quantity is the ratio of taa i.fay- integral to 
she volume. In the case of sources conlaxmoasty distributed in 

b;e, by means of unis ratio we -ar »jr k & ih source-strength 
for each element to' volume. For the H m : Uv~ vale* of the 
H'iia, ck& espresdon div v (read :c dxvns^.x o? v J % has been 
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introduced. Divergence is, then, equivalent to source-strength. Thus 
divt) = lim~ ..... (40) 


If the components of to are continuous point functions having con- 
tinuous derivatives, this limit is finite, and independent of the form of 
the volume element. 'This latter fact is somewhat troublesome to prove, 
but it is easy to show that the limit is finite. Thus, let the volume 
element selected be a small sphere of radius p at whose centre the 
field vector has the value v 0 . The departures from this value are then, 
in any case, of the order of magnitude of the radius p. Now tho sur- 
face element on the sphere is p 2 aQ, where dQ. is an elementary solid 
angle. The quantities to be summed in forming the integral are thus 
of the form (v B -j- wp)d8, where w is a variable finite vector, whose 
further properties need not be given. Since by (21) (p. 14) (j) dS~ 0, 
thepartof the integral contributed by the first part, viz. £ v B dti= v 0 j)d8 



Fig. 1? 


divided into volume elements. 
fo 7- eaph of these: 


vanishes, and there remains a con - 
tribution from the second part, of 
the order of p s . But the volume of 
the sphere is 4wp 8 /3, so that the 
result of division by this quantity 
must be finite. 

Let ue now consider a finit3 volume 
which is bounded by a closed surface 
(fig. 17). Let this region be sub- 
The definition of the divergence holds 


daw dr— (f) vdS. 


Let both sides be summed over all the elements of volume. The left 
side becomes the volume integral J div v dr. In the right member, all 
the interior surface elements occur twice (as surfaces of adjacent 
volume elements) with oppositely directed normals. Their contri- 
butions cancel out, since the vector v, on account of its continuity, 
has the same magnitude both times. There remain only the contri- 
butions of elements of the outer surface. This yields Gauss’s Theorem : 

£ vdB—Jdivvdr. ..... (41) 


That is, the surface integral of the vector v, taken over a dosed surface, 
is equal to the volume integral of the divergence of v taken throughout the 
enclosed volume. This is sometimes called the Divergence Theorem. 

If the hydrodynamical case is taken as a basis for demonstrating 
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this theorem, the result appears almost self-evident. The theorem, 
however, is true for any vector field satisfying the conditions given 
above. 

To obtain an expression for the divergence of a vector in terms of 
its rectangular components, we choose as 
volume element a rectangular parallelepiped 
with edges dx, dy and dz. At x, an amount 
of fluid dydz(v x ) x passes through the face 
dydz (fig. 18). Only the normal component 
of v contributes to the fluid passing through dz 
the surface. This fluid enters the volume, 
since the normal has the direction of the [ 
negative x-axis. dy 

At x-f dx, an amount dydz{v x ) r+ ix streams Fi*. is 

out through the opposite face. The normal is 

here directed along the positive x-axis. Expanding (v x ) m + ax by 
Taylor’s Theorem, we have for the contribution of these two surface 
elements 


(v„+ s £d*-,.y y dz. 


Analogous contributions are furnished by the two other pairs of oppo- 
site faces. Thus we have 


f vds - (£' + lj + 3-)***. 


and, after division by the volume dxdydz, 

- m 

dx dy ' dz ' 


divw = 


. . (42) 


Ex. 14. Calculate div r from first principles, and also by the formula. 

Ex. 15. How may the “ gold ” balance of a. country be calculated from the 
amount of imports and exports ? To what extent does this represent an applica- 
tion of Gauss’s Theorem? (“ Gold ” = equivalent of merchandise.) 


9. The Curl of a Vector, and Stokes’s Theorem. 

Another equally important differential operation leads from a 
given vector field not to a scalar field, as in forming the divergence, 
but to another vector field. 

We refer back to the relation (38) (p. 22), according to which the 
line integral of a vector v, taken over a closed curve, vanishes if the 
vector is the gradient of a scalar field. In general, this integral does 
not vanish; and its magnitude measures an important property of 
the field, especially when the integral is taken for infinitesimal sur- 
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face elements, and referred to unit area. Here, too, it may be shown 
that the limiting value is independent of the particular form of the 
boundary, 'and again an estimation of the order of magnitude shows 
that the limit has a meaning when v is a continuous differentiable 
point function. The value of the integral, 
however, depends on the orientation of the 
surface element, as may be shown by a simple 
example. Let the vector v have the same 
direction at all points in the region, but let its 
magnitude, represented by the length of the 
arrows in fig. 19, vary in a direction at right 
angles to the direction of v. If the surface 
element is placed so that its normal is in the direction of V, then 
v is perpendicular to ds along the entire circuit, and the integral 
vanishes. On the other hand, if the normal is perpendicular 
to v, then the net contribution of the edges perpendicular to V is 
zero, but that of the edges parallel to v is not, since the magnitude 
of v was taken to be different at the 
iafeter edges. Tire value of the limit 

L„~~ liis^ vds . , (43) 

is thus a function of the direction of the 
normal to the surface element. We now 
show that we can give the value uf this limit 
for any direction, if we know the value for 
any three non-oopl&nar directions. For 
convenience we take these three directions 
to he the co-ordinate axes! As surface 
element wo choose a email . triangle 
AS --- ABC, whose vertices lie m the axes 
(fig. 20). Instead of passing e round the triangle ABC, we could 
describe, in turn, the triangles OBC ~ AS„ 00 A = A S v and 
GAB *» A Sg. We can calculate the limit 

£„«= ~~ j vd*, &o. . . . (43') 


1 } 



for each. In adding the line integrals along the partial triangles, each 
axis will be covered twice, in opposite directions. There remains only 
the part contributed by the aides AB, BC and CA, so that the re- 
lationship iAS-X.AS.-f £„AS,-f £,AS» . . . (44) 

is satisfied. On the other hand, 'from § 4 (p. 14), 

A B — hAS =* -j- JA3 V -f At ASv , , 


(45) 
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The unit normal vector to AS ia denoted here by n, and is directed 
toward the outside of the tetrahedron, while i, j, k point inward. If 
cos a, cos ft cosy are the direction cosines of the normals n, this 
equation is, in terms of components, 

Aft, — A S cosa, A ft„ = A/S cosft AS, — AS cosy. 

Putting these values in (44) gives 

L n — L x cosa + L y Cos fi L, cosy. , . (46) 

The sum on the right znay be interpreted as the scalar product of the 
unit normal vector n with a vector whose components are L m L v , L a . 
This vector is called “ the curl of v ” (written: curl v).* 

We have therefore 

curl© = iL„ +jL v -f- kL, (47) 

It follows that 

LAS— <£ vds — «(curli/)Aft— (eurlzi)AS. . (48) 

Just as we obtained Gauss’s Theorem, when considering the diver- 
gence, by passing from infinitesimal volume 
elements to finite volumes, so we obtain 
here a new and important relationship by 
considering a portion of a surface bounded 
by a curve. This curve need not be plane, 
but is to be traversed in a given sense 
(fig. 21). Let the surface be subdivided 
into infinitely small elements. The boun- 
dary of each element is to be traversed in the same sense a3 the outer 
curve. Then, for each element, 

£ vds = (cur!©)AS f . 

If we add these equations for the separate elements, the contributions 
from the inner dividing lines cancel out on the left side of the resulting 
equation, since each segment of these lines is traversed twice, in oppo- 
site directions. The left member is then merely the integral along the 
bounding curve, and we obtain the important formula called Stokes's 
Theorem: 

£ vds — jcwclvd& (49) 

The line integral ofjhe vector v t taken over a closed curve C, is equal to 
the surface integral of the curl of v, taken over any surface having C as 
a boundary. 

• The vector is sometimes called “ the rotation of v ”, and written: rot v. 
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There follows at once an. important property of the vector, curl v, 
its divergence is zero. If we put curl v — m, then Gauss's Theorem 
may be applied to a closed surface, and 

^zvdS—J divM> dr — fdxv curl V dr. 

Let this closed surface be divided into two parts by a closed curve 
(fig. 22). Then 

J div curl vdr —J g wdS + J^ wdS—j^ ovxlvd&+ curlvdS. 

@ If, now, we apply Stokes’s Theorem to each of 
-•St the two parts, we must remember tfiat in for ming 
the integral over the entire surface, all the nor- 
mals are directed outward; but in the applica- 
tion of this theorem, the direction of the 
normals is determined by the direction in 
which the curve C is traversed. If we agree 
dS 2 to take dS always toward the outside, then 
Fi*. ax the sign must be reversed in transforming the 

second integral. We thus have 


Since this equation holds for any volume, it follows that 
div curl v — 0 


Further, the vanishing of the line integral of the gradient of a 
scalar point function, taken over a closed 
curve [(38') (p. 22)], may be expressed in 
the following way by applying Stokes’s 
f - Theorem: 

I curl grad«= 0. . . (51) 

It remains to calculate the components of 
the vector curl V from the components of 
the vector v. From (47) (p. 27) and (43') 
(p. 26), we obtain the ^-component of curl v 
by traversing a surface element which lies 
in the yz-plane. Let this element have the form of a square of 
side 2A (cf. fig. 23), with sides parallel to the axes. Only the y-com- 
ponent c* v contributes to the integral along the horizontal sides, 
only the z-component contributes on the vertical sides. If we let the 
subscript 0 refer to the values at the centra of the square, then v % 
along AB has the value 
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Along CD: <».)•+ (£).»+ (^)* 

The contribution of the side AB thus is 

jf* = / + *My = m %) 0 - 2 **(^~) • 

In the same wa y, the side CD yields 

£vds= -j\dy = - 2h(v v ) 0 - 2 h* 

Together, these two sides give 

J vds+ J v'ds = — 4A 2 . 

Similarly, the two vertical sides give 

f’vis+f‘vds~w(^. 

Since the area of the square is Ah 2 , division by this quantity yields * 
/ i x dv v 

and, in like manner, 

, , . dv x do t 

(curl v) y — 

, , . dv v dv x 

(onplt,),= ^-j 5 . 

Hence 

. (dv z dv v \. fdv x 3«A . , /dv v 3«A, 

° m ' v - ( 35 - si ) '+ ( 57 “ 3 ^+ (si - Hi)*- < 47 ) 

On the understanding that the “ product ” of the differentiating 
operator d[dx with a quantity u means dujdx, then the last equation 
may be written as a determinant: 

* J h 

A A 1 

dx 3 y d z 


curl v — 


Ex. 16. Calculate curl r. 

* The subscripts are omitted; the derivatives refer, as before, to tl 


( 52 ) 
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10. The Operator V. 

If we compare the three formulae 


(36) (p. 21): grad w= *g~+i||+ 


(42) (p, 25): div v = 


dy 


dV, 

~dz ' 


we recognize that we can express ail of them symbolically in terms of 
a vector whose components are djdx, djoy , ojdz, on the principle just . 
explained (cf. the last sentence of the preceding section). This sym- 
bolical vector is denoted by V. It was introduced by Sir William 
Hamilton, and is called “ nabla ”, after an ancient Assyrian harp 
whose form V resembles: * 

v -4+'4+4» <“> 

If we multiply a scalar quantity « with this vector operator, we 
obtain 

Vzi— grad«, . , (5U 


'dy~ } 


If we form the scalar produpt of the nabla operator with a vector v, 
we obtain, according to the definition of scalar product, the sum of 
the products of corresponding components: 


_ dv w 
dx 


d Jh>JL 


(55) 


Finally, equation (52) (p. 29) represents the vector product of the 
nabla operator with v: 

[V t>] = cm! v. (56) 


From the formal correspondence of the operations grad, div, and curl 
with multiplication, it follows at once that these operations are dis- 
tributive with respect to addition. That is, 

V (»! + t/ 8 ) = V v x + V v s , &o. . . . (57) 

By starting with the definition of divergence, and generalizing, we 
obtain a definition of the nabla operator which is independent of the 
co-ordinate system: 

* The name " nabla " was used by Heaviside. This operator 5» also frequently 
called “ del ”, and sometimes “ atled ” (the word “ delta " revoked!. 
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div v — V v — lim ~ £ iS v. 


gradw = V« = lim -- j) d&it, 
curly — [V y]=lim^^ [dS»]. 


We shall illustrate the method of proving these formulae by taking 
the case of the gradient. Since the form of the volume element is 
entirely at our disposal, we take ' i 

a small cylinder, the generating — f— _ Level Surface 

lines of which are in the direction J/TIkJ 1 — ~f ~~~7 

of gradw and are therefore nor- / TyX — / 
mal to the level surface (fig. 24). X \!X 'y^dS 

Apart from terms of higher order, / ""I / 

the value of w over the base of ~ ) 

the cylinder is w 0 — A | gradw |; X . j 

at the top of the cylinder, the j 

value is w 0 + A | gradw |. It is Fie . 2+ 

to be remembered that, m form- 
ing the surface integral, the elements dS must always point out- 
ward from the elementary volume. The lateral surface of the cylinder 
does not contribute to the integral, since w has a constant value in each 
transverse section; and to each surface element there corresponds 
another which is equal and opposite. Also, we obtain the contributions 
of the ends of the cylinder by multiplying the area AS by the dif- 
ference in the values of w, and by expressing the direction of dS by 
means of gradw: 

£ d&u — (AS) (2A gradw) = At gradw. 

Dividing by the volume, and passing to the limit, we obtain the third, 
equation of (58). 

These formulae yield important generalizations of Gauss’s Theorem, 
the proof of which is analogous to that given above: 


£ dSu j gradw dr. 
j) [dSv] = f curl v dr. 


Theorems (69) and (60) are also easily proved by writing them out in their 
soalar forms (which are often useful) and using (41). . For (41) can be written 
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/<“ + + -0)iS - /(|f *. §r + |f ) 

»}(«« /I , jB, <7 are scalar functions, and f, w , n are the direction cosines of the 
outward normal to 5. Since A, B, 0 arc independent, it follows that 

judSm. j ~ dr, JmBdS ---» J nC’dS = J—ilx. 

These iormu!® are closely connected with Green’s Theorem (p. 270). They aie 
equivalent to (59), and also lead at once to (60), which involves three equations 
of the type 

/<""’• “ “ /(|y - J?)*'- 

11. Calculation of the Gradient in a Vector Field; Fundamental Prin- 
ciples of Tensor Analysis, 

In § 7 (p. 20) the question “ How does a scalar point function 
change with a displacement dsV’ was treated. This question may be 
extended bo vector fields,, and we may inquire “ How does the vector 
v change with a displacement dsV’ If we start with the components 
of the vector, this question may be referred back to the case of a 
scalar point function. If a vector field be given, then at each point in 
space we have a field of three scalars, viz. the three components. But, 
according to § 7 (p. 21), the change in the components ruay be written 
down an once; 

dv a — ds gr&dUj. ----- ~d-r -f -j- <h, i 

3 dv d v 5 

&o„ — ds gradt’ u ~ — ’ -dec -f- *—< lu -f -vr-dz, > . (61) 

* ® v dx dy * dz i 

dv z = ds grade, =- ^.---dx -j- dy 4~ < ~dz, | 

. dX Off ‘ OZ J 

a,n.d so 

dv — idv x ~\-jdv v -f- kdv, 

= ds grade* .*-}-£$ grade* . / -J- ds grad v x . k. . (6.1 ') 
Merely as an abbreviation we write 

dv — ds yi’aa ■ v, , , . , . (62) 

which is read; “ ds- vector gradient of v” 

In order to be able to calculate dv, three vectors (or nine scalars), 
must be known. We shall see that such forms, consisting of nine 
numbers, have also a physical meaning. ' Srom (61) we see that the 
components of the vector dv ate linear functions of the components 
of the vector ds, the coefficients of these functions being the partial 
derivatives dv x jd as, 8 vjdy, &c. 

Let us now investigate such linear vector functions in detail. As 
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in (61), let the components of a vector m be linear functions of the 
components of a vector v : 

W x = + <*12 «v + «13 V«1 

<», == «a »• 4- «« «, + «»«»[. • • • (63) 

W* = <*31 V* + «33 *, + «S3 M 

In this manner the vector m is placed in one-to-one correspondence 
with the vector v. The scheme of coefficients a,* has thus an independent 
meaning if this correspondence is stick that the passage from v to w is 
independent of the particular co-ordinate system in which the vectors are 
resolved into components. In this case, the coefficients a a are said to be 
the components of a Tensor* 

A vector is a physically determined directed quantity in space, but 
the values of the components depend on the choice of axes. In the 
i, j, k system, ^ = „ ^ = vj> % « vk . 

For a system i',f, k', turned with respect to this system, 
i' — Pn * 4 PisJ + Pl3 j 
f = Pni+PnJ + M\ .... (64) 
k! = B 31 i + Bzd J r P»h, j 

where, since i' , f , k' again form a rectangular system, the fi i!c represent 
the direction cosines of the new axes referred to the old; e.g. p ik are 
the direction cosines of the new axes with respect to the old. Then, 
as may be verified by forming the scalar products i' f «= 0, i'i' = 1, &c., 

Ai'+ZV + = 

PnPn + PuPv 4 P a P* - 0;. 
m general, ££■*&* — 1 for i=j 1 

= 0 for i =# j. f ' * * * '-' b55 
Thus the old axes are expressed in terms of the new: 

* SB Pni'.+ Pnf+0u k > ) 

J - B u i’ + p 22 f + &,*', [ . . . . (66) 

k — ySjj i’ -f- f 4 B’is 4 • j 

The components of v become, in the new co-ordinate system, 
v x . = vi' = ,8 n vi+ p i2 vf+p i3 vk^p n v x + p 12 v v + p i3 v„) 

%•= vf—pnvi + P^vj 4 P zs vk= ^ n v w + A*®*} (67) 

v t - = vk' ^P^vi- f- p 32 vf+ p m v.k^ p ai v x + P^v v + P&v,, j 
i.e. the vector components transform in the same way as the axes. 
We thus have also 0 , Q , 0 » 

’-V = Pu»*' 4- P 21 ®,' + p S i®e . j 
= fta«V 4 &2«V 4- 0 m«V. }-.... (68) 

4- &3*V 4- j 

* Also called in Qsrnian works an Affinor, the term originating in the fact that for 
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How must the tensor components a ik transform, if a change of co- 
ordinate system i3 not to alter the relationship of vu to v ? The follow- 
ing equations must hold: 

w, = -f a li v v 4- a 13 v„ 1 
w v — a zl v x + a^v v 4- a a v„ j- .... (69) 
w x = a sl v x + a S2 v v + 033 «„ J 
and «V = an'** + <hs'«V ’I" a is'«*'. 1 

«V — O21V + «82'<V + <hs' v s, J- . . . . (70) 
«V = a 31 'v + Ogj'v + Oga'v,.. J 

In addition, w*, u\, w, and to*-', ey, «y must be so related that the 
same veotor w is determined by both triads. If we multiply the first 
of the unaocented equations by /8 U > the second by Aa> the third by Aa> 
and add, we obtain, by (67), 

«V = (Ai°u + A 2 ° 2 i + P\s a si}°x + (Ai°i 2 + Aa a 22 + As^A* 

+ (Ai°is + Aa«» + fiv$ a vi) v ?. 171) 


If we express the v x , v V! v, here in terms of the new components, 
according to (68), we obtain wy as a function of ey, v v -, e*.. This value 
of wy must be identical with that defined by (70). Comparing the 
coefficients of ty, e,-, we have 

( hx ~ Ai 2 *n + AiAa a ia + ArAs^hs + As Ai® 2 i + Aa 2 ®32 
+ Aa As<hsH- A 3 Ai“3i + A 3 A 2 ^32 + A3 2<1 3S- • (72) 
Thus the tensor components transform, under rotation of axes, in 
such manner that the new components become linear functions of all 
the old ones, the coefficients being the squares and products of the A*- 
To find the coefficients for the tensor components a <4 ' set «*- — A, 
o« = «!, &c., and use (67) to form the product e/e*' of the vector com- 
ponents, remembering that in expanding the brackets the order of the 
factors Vi and v k mu3t not be changed. Replacing v { 'v k ' by a (k , the co- 
efficients of VfV k yield those of a tli . 

It i3 possible to represent a tensor by means of vectors. For the 
three components of zv in equation (63) (p. 33) may be written as 
the scalar products of the vector v with the three vectors a L , a 2 , a 3 . 
This amounts to regarding the three coefficients of each row as vector 


components: 

w 


a { = a n t + 
— a x v • i -f a 2 v 


a<d + a 6 k, \ 
'j + OsVk. f 


. . (73) 


But, as may be seen in the example of forming a gradient, the three vec- 
tors depend upon the choice of co-ordinate system, for the three vectors 
grade*, grade, and grade* are dependent upon the co-ordinate system. 

Since the components of zv are linear homogeneous functions of 
the components of v, according to (63) (p. 38), this functional cor- 
respondence O satisfies the relationship 


<&(??! + v 2 ) — <b(tT) -f O (u 2 ), 


. - (74) 
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i.e. «I> is distributive with respect to addition. But this the essential 
property of multiplication, since the other properties of products 
have already been given up in the case of the vector product. We 
may thus consider the vector w as resulting from the multiplication 
of the vector v with the tensor <t>. This tensor may be represented, 
according to (73), by three pairs of vectors (dyads): 

a 2 'j + a 3 • A. . . . . . (75) 

The first factors in each term are called the antecedents, the second 
factors, the consequents’, the dot does not indicate multiplication, but 
merely separates the factors. We understand <bt) to mean a vector 
which results from scalar multiplication of the consequents with v, 
and v $ means a vector obtained from v by scalar multiplication of 
the antecedents with V. Thus 

Qv^^iv + Ot-Jv + at-kv, . . (76) 

t><E» = a 1 vt + a 2 v -J -f a z v • k. . . (77) 


The two vectors are, in general, different. The tensor resulting 
from interchanging the antecedents and consequents is called the 
conjugate^ tensor <I> 0 . 


4> 0 = at-Oj +j'Ct i 4- 

‘l>„v — i'a 1 v-}-j'a t v+ k'a a v.> 


. . (78) 


Since it is immaterial whether the scalar a^V is written before or after 
the vector i, comparison with (77) shows that 

a® = (79) 


In the component representation of a tensor, changing to the conjugate 
tensor means the interchange of rows and columns in the array of 
coefficients. 

The dyad, indicated by writing two vectors side by side with a dot 
separating them,* also has the leading property of multiplication, viz. 
distributivity with respect to addition: 

(a x 4- a a ) -t — a x -i+ <v i (80) 


To show this it is sufficient to form the soalar product of the ante- 
cedents with a vector v, in which case 

v(a 1 + a 2 ) • i = woj • i 4- va ? • i, 

since the scalar products are distributive; the result therefore follows. 
We may therefore look upon the union of two vectors to form a dyad 

* Parentheses may, in some cases, be necessary for clarity, e.g. if the factors are 
multinomials. 
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as a new kind of non-commutative product, called the direct or dyadic 
product. This leads to a new interpretation of the expression ds grad • v 
with which we started. If we form the dyadic product of the operator 
nabla with V, there results 


= +j Ty + * Si) ' <’■ * + + *•*> 

= grad v x • i -j- grad 'j -j- gradu, • k. 


(81) 


If this tensor is now multiplied by the displacement ds, we obtain 
exactly (62), viz. 

dsV ’ v — ds grad • v (82) 


By dyadic multiplication of the nabla operator with the field vector v we 
obtain a tensor called the vector gradient of v. Multiplication of the dis- 
placement ds with this tensor as post-factor gives the change of the vector 
v due to the displacement ds. 

Let us again consider tensors in general. A tensor is said to be 
symmetric if 

0 O — O, or, in terms of components, if a ik — a kl , . (83) 
and is antisymmetric (skew-symmetric) if 

0 0 = — O, or, in terms of components, ~ j. . (83') 

By (83), the symmetric tensor has but 6 independent components, 
and, according to (83'), the antisymmetric tensor has only three com- 
ponents, i.e. as many as a vector. As a matter of fact, in three dimen- 
sions, the multiplication of a vector with an antisymmetric tensor 
always may be replaced by forming the vector product with a vector. 
In physics, symmetric tensors are specially important. They may be 
represented by a surface of the second degree. The vectors v are drawn 
as radius vectors r from a fixed origin. By means of the tensor ©, 
a vector tv — r' is allocated to each vectors*. Let the scalar product 
rr* be designated ! >y p. Then 

vv’^zp— r(®r)~ a- a x* + a it xy -f a is xz -f a 21 yx + a n y*+ .... 

Because of the proviso that 0 is to be a symmetric tensor, we 
have 

p = r(0 r) = a u x 2 + a 9i y 2 A- a 33 z* + 2 a 13 xy + 2 a 33 yz + 2a u zx, (84) 
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from which there follows 

+ |f,\ ... (85 ) 

r'^i gradp. j 

This. gives a graphical construction for r'. By (85), r' has the direction 
of the normal to the surface p — const., at the end point P of r. If we 
draw the surface p — 1, then, according to (84), the magnitude of r‘ 
is the reciprocal of the projection of 
r on the direction of the normal at 
P (cf. fig. 25). 

The surface p — 1 is called the 
tensor ellipsoid, although without 
further limitation on the values of 
the coefficients, it might be any 
second degree sutfaoe. The repre- 
sentation by means of a second degree 
surface is seen to be possible, since 
both a surface of this kind and a 
symmetric tensor have six indepen- 
dent coefficients. Further, for every surface of the second degree 
there is one specially appropriate co-ordinate system, viz. that of 
the principal axes. In this system of axes the surface has the 
equation 

P = a t x' 2 + a n y' 2 + a ul z' z .... (84') 
and the tensor, referred to these new i', j\ k' axes, becomes simply 
0 = a,i’ • »' -f a n j' -f + a m k' • k' . . . (75') 

The components of w are thus given by 

mv— W v‘ — = 0, n ,V z : . . (86) 

Ex. 17. Write the Frenet formulas [equns. (32), p. 19. and (S5), p. 20], using 
the vector gradient. 

Ex. 18. Determine the directions of the principal axes of a tensor ellipsoid 
by using T.he relation that r" = \r in these directions. 

12. Calculation of more complicated Vector Differential Expressions 
with the help of the Nabla Operator. 

The operations grad, div, curl and grad may be performed readily, 
in the case of products of scalar or vector point functions, with the 
help of the V-operator. It is only necessary to remember that V is a 
sign of differentiation as well as a vector. Everything standing to the 
right of such a sign is subject to the process of differentiation. In 
ordinary differentiation, the quantities to be held constant may always 
be placed to the left of the differentiating symbol. But sine® the 
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vector product, and especially the dyadic product, of two vectors are 
not commutative — though of course in a vector product we can always 
change the order by changing the sign — we shall indicate by means 
of the subscript c the constancy of a factor on the right of the sign 
of differentiation. This will permit us to leave constants on the right 
side of the sign V. By following a limiting process entirely analogous 
to that for the ordinary differentiation of products, it may be shown' 
that, in general, a space derivative (grad, div, curl, grad) of a pro- 
duct is equal to the sum of the corresponding expressions for products 
in which only one factor is variable. Thus we obtain the following 
practical rule for the space differentiation of products: Using the 
V -symbol, write the differential expression as the sum of derivatives in 
which only one factor is subject to differentiation. Transform time by 
the rules of vector algebra in such a way that all qiMntities not to be dif- 
ferentiated hre on the left of the sign of differentiation, and interpret 
the resulting expressions in terms of the special symbols, grad, 
div, <6 c. 

This method is easily understood in the case of the more important 
and frequently occurring formulae. These expressions will be computed 
below. 

(a) Products of scalar point functions 

1. grad wow — V«o,w,+ Vu,vw, -f Vu,v,w 

— vw grad u-\- wu grad «+ uv grad w. . . (87) 

(6) Product of a scalar point function with a vectorial 
point function 

2. div uv — Vuv,+ V u,v = V grad u-j~ « dive. . (88) 

3. curluw = [V uv,] + [V u;a] = [gradu, o]-f « curl v. (89) 

4. grad ■uv—'V- uv,+ V • u,v — grad u-v + u grad • v. (90) 

(c) Products of vectorial point functions 
gr&dvzv— V * vw,+ V * v 0 w. 

Now [i> curltn] == [»[Vw]] = V* v,w — * w 

= grad v,m — v grad • w; 

thus 

grad vrv—[v curl«/]-f v grad -w + [vo curl v] + w grad- v. (91) 
div[vm]= V[z>ra>.]-|- V[z;,,iy] = mj[Vo]- »[Vte] 

= vo cuilt> — t; curlttJ. 


(92) 
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curt[ow3 = [V[fM>.]]+ [Vfw.w]] 

= w grad • v — vu div v — v grad * zv -}- v divze». [95) 


id) 


div gradw = V Vw 


Second derivatives 

3m 3 3« 3 ?w 

dx dx dy dy 3 z 3 z 


3®m 3 2 « 

3 ^+ 3*5 + 


3®« 

3P 


= Au. . , . (94) 


Toe scalar product of the V-operator with itself is therefore equal to 
3 a 3 a 3 2 

the Laplacian operator g-| + g~ 2 "I" denoted by A. Note, how- 
ever that the rule for products stated in § 12, p. 38, does n os apply to 
A, as the following example shows: 

div grad wv — div (« grad® -j~ v gradw) — u Aw -J- ® A U 
-}-'2 gradw grad®. 


Again, we have curl grad u= [Ww] — 0. .... (49) 

This result, already obtained in § 9 (p. 28), follows purely formally 
here from the rule of vector algebra which states that a product of the 
form [AAa] vanishes. Similar remarks hold for the relationship 
div curl v— V[Vt>]= 0, .... (48) 


also obtained in § 9 (p. 28). 

On the other hand, there is no simpler vector expression for 
grad divp. In rectangular co-ordinates, this expression signifies the 
vector 


. 3 


. 3 

'ar 




d*v v 8 2® A 
3x3 y dzdz ) 


+ *(**£ + 

' • \3x3y ^ dy* ' 




dydz) + k (dxdl + dydz + 3 J)' (96) 


With the help of the expression for grad divrr the expression curl 
curl V may be transformed in a manner which finds important appli- 
cation in electrodynamics: 

curl curlt>=[V[Vo]]=V* Vu — VV- r>=grad divt>— VV-». (96) 

Here we have the scalar product of V with itself standing before a 
vector. If this vector is written in terms of its components, the com- 
plete expression curl curl v may be calculated with little difficulty; 
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the computation shows that the result is grad div v minus a vector 
with components as follows: 


*+?*+? 
2 ^ dy^ c 


c 2 v x , a 2 «A , ./d e v v , 

7V ’ v *(a®* + 9j^ + 

+ A (S'+ 5? + S') “ + **».“ *»• <S>7) 


Thus the rule VV = A, which held for simple point functions, is true 
here also. Hence we have. the important relation 

curl curl v = grad div® — Av. . . . (98) 


Ex. 19. . Calculate the following surface integrals for constant vector i: 
(a) & {ir ) . d&i [b)(f)r,idS. 

Ex. 20. Calculate A -. 


Ex. 21. Convert the surface integral 

£ (it grad v — v grad tt)dS 

into a volume integral. In particular apply the result to v ±= 1/r. It should bo 
remembered that the point O is to be excluded from the region of integration, 
on account of the discontinuity of 1/r. Thus the surface integral must also ex- 
tend over the small sphere enclosing O and having its normal directed toward 
this point. 


13.* Differential Vector Operations in Curvilinear Orthogonal Co- 
ordinates. 

In many cases occurring in practice, it is more expedient to resolve 
a vector not in Cartesian co-ordinates, as done exclusively thus far, 
but to use other' parameters which are often inherent in the problem 
itself. For example, in studying the propagation of electromagnetic 
waves over the earth’s surface, spherical surface co-ordinates (latitude 
and longitude) are indicated. In space, a point is located by means 
of three parameters w, v, w. The rectangular co-ordinates are then 
functions of these parameters: 

x = ^{u, v, w), y = <£ 2 (w, v, w), z = <£ 3 (w, v, w). . (99) 

If one co-ordinate, say w, is held constant, (99) will represent the 
parametric equation of the surface u -- const. In the same way we get 
another surface for v = const., and through every point in space there 

Starred sections may be omitted without destroying the continuity. 
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will be three surfaces corresponding to the three co-ordinate planes. 
We restrict our considerations to those systems where the three co- 
ordinate surfaces form an orthogonal set, i.e. are normal to each other in 
pairs (Jig. 26), as this type of curvilinear co-ordinate system is almost 
exclusively used. In the case of spherical space co-ordinates it is 
immediately evident that the three surfaces — sphere, radial cone and 
meridian plane — are mutually perpendicular. Elliptical co-ordinates 
arc abc orthogonal. Their surfaces consist of confocal ellipsoids of 
rotation, hyperboloids of either one. or two sheets, and planes con- 
taining one or the other principal axis of the conics. If the co-ordinate 
surfaces are orthogonal, so will be the curves of intersection, called 
co-ordinate lines. The tangents drawn at an intersection of three 



surfaces will be characterized by the unit vectors u, v, m. correspond- 
ing to the Cartesian unit vectors i, j, k. There -is an important dif- 
ference between the two sets: the former change their direction as 
we go from point to point. If a vector A is given as a function of posi- 
tion, its respective component# at the point P are understood to be 
its projections on the directions u, v, w drawn at the point in ques- 
tion. The scalar and vector products retain their form in this scheme 
and it is only when we pass to differential operations that the relations 
differ from their Cartesian forms. If the co-ordinate u is allowed 
to increase by du, there is a displacement in the u direction of 
amount 

ds u ~ Ulu, v, w)du u, (100) 

where the function U(u, v, w) changes from one place to another. In 
an individual case its value may be read off from a figure. In the same 


dSg — V(u, v, w)dv v, ds a — W('u, v, w)dv> xs> . (lOOo) 
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The components of the gradient of a scalar function 1(1 ms y be found 
at once as the projections in the u, v, m directions, and these pro- 
jections represent the derivatives dtft/ds in these directions. From 
( 100 ), 

8 '“-* = VS (101 > 

In order to compute the divergence of a vector A we start with the 
definition (40), p. 24, selecting as volume element a parallelepiped 
formed by co-ordinate surfaces. Consider the two sides normal- to u. 
One of these is strictly perpendicular; the other is perpendicular as 
far as quantities of the second order. Only the ^-component A u will 
contribute to the surface integral. At the point u the area of the sur- 
face element is | ds v j | ds a j = (7TF) U dvdw, while at u -f- du it is 
(VW) u + iK dv die, bo that considering the opposite directions of the 
normals, the net contribution of both surfaces will be 

~(AJ W) u dv dw -f (A u VY/) n + e „ dv dm -= du dv dm. 


Forming the analogous expression for the other pairs of surfaces and 
dividing by the volume £ 7YW du dv dw, we have 


div A 


1 [d(A a Y W) d{AJ VU) 
UVW i du + dv'" 


d[AJJVJ\ 


. ( 102 ) 


Substituting the special value grad i ft for the vector A, we have 



In computing the curl we proceed again as in the case of rect- 
angular Cartesian co-ordinates by traversing, one after the other, three 
elementary rectangles lying, respectively, in the three co-ordinate 
planes. It must be remembered that the lengths of two opposite sides 
will differ somewhat. In the surface u — const, we get the contri- 
butions (J.F),, dv and (~A v V) a , +du> dv by going along the v curves and 
the contributions —(A w W) v dw and {A w W) v+iv dm along the w curves. 
Expanding these expressions, adding them and dividing the result by 
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the area, we get the u component of the curl. The other two follow 
by permutation of the letters: , 


1 \d{A w W) 3(4,F)] 
' VW L dv dw 

1 [d(AJJ) d(A a W )] 


-d(A v V) d(A u U) l 
du dv J' j 


The expression for the vector gradient in curvilinear co-ordinates 
is troublesome to derive, and since it is seldom needed it will not be 
deduced here. 


Ex. 22. Compute grad <|>. dir A, Ai[< and curl A in cylindrical co-ordinates 
[z, p, ,<j>) and in space polar co-ordinates (r, <j>, 0). 


14. Degeneration of the Vector Differential Operations at Surfaces of 
Discontinuity in the Field. 

In proving the above relationships of vector analysis, we have 
hitherto imposed the condition that the point 
function be continuous in the region under 
consideration. However, the case often arises 
in physics where a scalar or vector point 
function has different values on two sides of 
a given surface. In this case, the limiting Fig. 27. 

values defined by our differential operations 
become infinite. If, for example, we consider the divergence 

div v — lim v dS, 

this limit has a meaning only if the vector v is continuous within the 
volume element At. Suppose, however, that a surface of discontinuity 
passes through At. Then we may choose as volume element a small 
cylinder whose bases AS are close to the surface of discontinuity, and 
whose lateral surface is perpendicular to it (fig. 27). With continuous 
point functions, the difference v 2 — v x of the values on the two bases 
of the cylinder vanishes as the altitude of the cylinder is reduced to 
zero; here, however, v % — v x becomes the difference of the values on 
the two sides of the surface of discontinuity. If we divide the integral 
<j) v dS, taken over the surface of the cylinder, by the volume At, the 
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result is infinite, for At vanishes to a higher order as the cylinder 
is made smaller. If wo wish to obtain a limit in such cases, we 
must divide by the area of the base AjS rather than by the volume 
of the cylinder. The resulting limit is called the surface divergence 
(Div v ): 

Di v v h'm v (105) 

If we introduce as unit normal vector a vector of length 1 pointing 
from side 1 toward side 2, then, with a sufficiently small surface element, 
we have for the lower side 

J vdS = ~nv , AS, 

and for the upper side 

j v dS — +nv 2 AS. 

The negative sign is to be used on the lower side, since the normals 
are to point outward in forming the surface integral, but. the vector 
yt was taken to point inward. The contribution of the lateral cylindri- 
cal surface can be made an infinitesimal of higher order by reducing 
the altitude of the cylinder. We thus have, as a limit, the contri- 
butions of the bases of the cylinder, and so 

Div v — lim ~ n (r> s — v t ) AS = n (v z — v x ). . (106) 

Since the definition by means of the limit of a surface integral divided 
by the enclosed volume is not confined to the divergence, but repre- 
sents the general co-ordinate-free definition of the V-operator ; we 
obtain for surfaces of discontinuity, in analogy with the ordinary 
V-operator, a reduced operator — the u surface nabla ” — denoted by 
the sign ||. Its particular meaning depends upon whether it is applied 
to a scalar- or a vector-point function, and will be made clear by the 


following: 

Surface gradient: || u — Grad u — n(u 2 — «j). ... (107) 

Surface divergence* || v = Div v — n(v 2 — v t ). . . . (108) 

Surface crud: [|| w] = Curl v — [ n(v 2 — i> t )] (109) 

Surface vector gradient: || • v Grad ■ v — n ■ (v 2 — v t ). . (110) 
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If a surface of discontinuity passes through the region of integration 
F, the theorem of Gauss must he supplemented by a term corre- 
sponding to the surface divergence at this surface of discontinuity. 
This is done as follows: A closed surface 13 drawn am-rounding the 
surface of discontinuity closely (fig. 28). This separates the latter surface 



Fig. 38 


from the region of integration. Gauss’s Theorem is now applied to the 
remaining volume, remembering that the new surface F t , as, well as 
the outer surface F v contributes to the surface integral: 

<j) V dS j) V dS — f V dr. 

If n is again used to indicate the unit normal vector of the surface of 
discontinuity, pointing from side I to side 2, then 

(ft t> dS = 3 = j MVydS — J nv^dS — — j Div vdS, 

whence j) vdS ~j div vdr -j- J Div vdS. . , (111) 

How is the reduced nabla operation to be performed on a product 
of point functions which are discontinuous on a given surface '? If the 
calculational rules of § 12 (p. 38) are formally applied, here, what 
moaning is to be assigned to a quantity with subscript c? A simple 
manipulation makes this clear. We have 

n{u z v 2 — wpty = [~‘ 2 L ^ S (“» ^ — %)J • 

If we designate the average values (i’ x -f- u 2 )/2 in the custotu&ty way, 
by a horizontal bar over the symbols, then 

Grad uv — || uv c + !l m„u -- nv,(u 2 — ■ Mi) -j- #*%(«g — ty). (11?) 
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The subscript c on a quantity standing after the surface nabla thus 
signifies that this quantity is to be replaced by the mean of the values 
on the two sides of the surface of discontinuity. In this way all the 
calculational rules of § 12 (p. 38) may be applied formally to the 
degenerate differential operations. 

15.* Fundamentals of the Matrix Calculus.* 

On p. 33 the coefficients a ik fulfilling certain transformation con 
ditions were designated as tensor components. It proves useful in many 
branches of physics to introduce quite general ar rays of such coefficients 
as mathematical quantities. An array of this kind need not be square; 
it may have m rows and n columns, and would then be called a matrix 
of order m, n. The individual coefficients a ik are called elements of the 
matrix. The entire matrix is designated by a bold-face letter: A. 

Two matrices are aqual when and only when their corresponding 
elements are identical. Thus a matrix equation replaces rnn numerical 
equations. A matrix having all its elements equal to zero is called a 
null matrix, 0. A square matrix having only its diagonal elements 
' different from zero is called a diagonal matrix. If all these diagonal 
elements are equal to unity the matrix is called tho unit matrix, E. 

A square matrix is said to be symmetric if each element is equal to 
the one which is in a symmetric position to it with respect to the 
diagonal, that is, if a ik = a ki . A square matrix is said to be anti- 
symmetric (or skew symmetric) if each element is opposite in sign to its 
“ reflection ” in the diagonal, that is, if a ik = — a fcs and further, if the 

diagonal elements are zero (a H = 0). The matrix A formed by inter- 
changing rows and columns of A is called the transpose of A. A sym- 
metric matrix is, then, its own transpose. 

As in the oase of tensors, which are merely a special kind of matrix, 
the sum of two matrices is defined as the matrix in which each element 
is the sum of the corresponding elements of the separate matrices. 
Logically, the product of a matrix by a number n is then taken to be 
the matrix, each of whose elements is fi times- that of the original 
matrix. 

Up to this point we have given merely an alternative way of writing 
the results of p. 33. The matrix calculus becomes really fruitful when 
,we define the product of two matrices. This permits us to replace long- 
winded summation formulas by a new symbolism. We start again with 
a linear transformation which, through the matrix A, gives us the 
variables z k in terms of the variables y k . If, in turn, the variables y k 
are given in terms of x k through the medium of a matrix B, then the 

* Although the matrix caloulua will be used very little in the text, a brief acoount 
of the basic principles is given here because of the increasing use of this branch of 
mathematics. 
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z k are themselves linear functions of the x k . It is readily computed 
that the coefficients e <k are given by 

Cti — • (US) 

This means thr.t the matrix element c {le is obtained by multiplying each 
element of the ith row of A by the corresponding element of the fcth 
column of B and adding the products. It is logical to designate this 
combination as the product of the two matrices, since it can be shown 
readily that it has the property of distributivity 


A(E + C) = AB + AC .... (114) 
as Veil as of associativity 

A(BC) = (AB)C (115) 


On the other har^, like the vector product, it does not possess com- 
mutativity: The result is quite different from the. previous if the z k 
and the y k are related by the matrix B, and the y k and the x k by the 
matrix A. 

It may happen that the product of cwo matrices is independent of 
their order of multiplication; the two are then said to be commutative ; 
for example, the unit matrix E is commutative with every matrix. 
The product of any matrix with the null matrix Q is always the null 
matrix itself. On the other hand, the product of two matrices, neither 
of which is the null matrix, may happen to be the null matrix; it is 
not proper to conclude that if the matrix product vanishes one of the 
factors must be the null matrix. .In view of the rule for forming the 
elements of the product matrix, we have 

AB = BA (116) 

The definition of matrix product is not confined to square matrices. 
The procedure designated by (113) can.be carried out as long as the 
number of columns 4>fi the first matrix is equal to the number of rows 
of the second. Thus matrices of order m,n may always be multiplied, 
by those of order n,r; the product will be a matrix of order m,r. This 
permits us to write vectors as single-column or single-row matrices, e.g. 

(117) 

The transformation prescribed by a matrix A can be written 




= A r. 


(118) 
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The product of a matrix A of order 3,3 with the matrix r of order 3,1 
is a matrix of order 3,1 and so is again a vector - 

If the determinant formed from the coefficients of a given square 
matrix is different %>m zero, then the matrix which represents y k as 
functions of x k may be solved for the x k as functions of y k . The new 
scheme which results is the inverse, or reciprocal l, of the original matrix 
and is designated A -1 . Since application of A to the x k gives us the y k , 
and application of A -1 to the y k again gives us the x k , it must be true 
that 

A- X A r= A A 1 = E. ..... (119) 


The second equality may be verified by starting with the y k . Since the 
inverse of the inverse matrix must again be the original matrix, 


(A* -1 ) -1 = A. ....... (120) 

Moreover, 

(AB)- 1 = B -1 A -1 (121) 


This is true because if we multiply both sides of the equation by AB, 
E = ABB 1 A -1 a AA- 1 — E, . . . . (122) 


on account of the associative law. 

An orthogonal matrix is one in which the sum of the products of 
corresponding members of two rows has the following properties: 


]i,a ik a lk = 0 for t ^ 

= 1 for l = ij' 


(123) 


The definition may be- expressed in another way. The product sums 
defined above represent the product of the matrix A with its transpose 
A. If a matrix is orthogonal, then 


(124) 


However, this is also the definition of the matrix A -1 which is re- 
ciprocal to A. Thus an orthogonal matrix has the property that its 
transpose is the same as its reciprocal. An example of an orthogonal 
matrix is th4 set of direction cosines of two rectangular co-ordinate 
systems that differ in orientation (equation (64), p. 33). 

The matrix calculus may be extended to include matrices with 
complex elements. The transposed rhatrix is then one where, in addition 
to the interchange of rows and columns, each element is replaced by 
its complex conjugate. A symmetric matrix then corresponds with the 
so-called Hermitian matrix, and the orthogonal matrix, because of 
OU= E becomes identical with the unit matrix. 
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Ex, 22 a. The question posed in Ex. 18 (p. 37) may be formulated in matrix 
notation as follows: Given an n-dimensional symmetric square matrix A. Find 
the vectors r (single-column matrices) for which 

A r =•= Xr or (A - XE )r =■ U. 

Show that (a) this leads to an equation of nth order in X; ( h ) the r corresponding 
to various values of X are mutually perpendicular (since it is only the ratios of the 
components of r that enter, their magnitude may be taken equal to unity); 
(c) the matrix formed by using these vector components as rows is an orthogonal 
matrix, 3; (d) the matrix S -1 Af! is a diagonal matrix with the roots X,- occupying 
the diagonal. 



CHAPTER H 


Mathematical Represent atic n oe Periodic Phenomena; 
Theory of Vibrations and Waves 

L Simple Harmonic Vibrations. 

The simplest temporally periodic process (vibration) is described 
mathematically by the sine or cosine function. If the process repeats 
itself v times per second, the function is either 

u = A wh2ttvI or u—Acoa2m4. ... (1) 

A is called the amplitude and v the frequency of the vibration. Re- 
sides the frequency v, we introduce the angular frequency to — 2nv 
to simplify the notation. A phenomenon described by a simple sine 
or cosine function is not only the simplest vibration mathematically, 
but physically as well. It is called a simple harmonic miration. 

The computations are considerably simplified by using imaginary 
exponentials instead of the trigonometric functions with their cumber- 
some addition theorems. The connexion between the two types of 
functions is given by tho Euler Formula 

e*“ l — cos cat + i sincoi. ..... (2) 

This notation also leads to an extremely important representation of 
a vibration, using the complex plane (cf. Chap. Ill); for 

z = Ae M (3) 

represents a complex number whose representative point, or index, 
as we shall call it, describes a circle of radius A with angular velocity 
co. The projections on the real and imaginary axes are respectively 
x = R(z) a* A cos cut; y = I(z) — A sin cut. . . (4) 

But since physics deals with real quantities, the final results of a 
computation using complex numbers must admit of translation into 
real magnitudes. This is very simple'; an equation involving complex 
numbers means that the real as well as the imaginary parts satisfy 
the equation. We can therefore take either the real or the imaginary 
part of the equation as the physical statement. 

The square of the amplitude is often of importance. It is most 

M> 
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readily obtained from the complex representation by multiplying the 
vibration function by its conjugate: 

4 a =*z. ....... (5) 

The great advantage of using complex numbers manifests itself in 
the case where two vibrations of the same frequency, but of different 
phase, are to be combined. If there is but- one vibration function, the 
point at which we begin to count the time is of no consequence, and 
so we write this function in the form (1) or (3). If, however, we are 
required to add a second vibration to this one, the new oscillation wiil 
not, in general, attain its maximum value at the same instant as the 
original one, but at a certain interval before or after. In this case 
we say that there is a difference of phase 8 between the two vibrations, 
ffn the real representation, then, 

u L = A 1 cos a>t, w 2 — A 2 cos (cot -f- 8), 


and the resultant of the two oscillations is 

% + it, = A x cos wt 4- A 2 cosS eoBtot — A 2 sin8 sinrni. (6) 

The right member may be written as , a single cosine vibration by 
transforming the coefficients of sin ait and coseii so that they assume 
the character of the sine and cosine of an angle <f>, multiplied by the 
same coefficient: 


{Aj-\-A 2 cos S) cos art —A 2 sin S sinait 

, i AjA-AoCosS 

—V (A 1 -}~A 2 cos hf+Af sin 2 8 - 777' -- . S 2 1 1 . ■ cosait 

1 I V( Aj.4- A & cos S) 2 4~ Af sin K 8 

.4 2 sinS 

V ( Jj4 -A 2 cos S) 2 +A 2 2 sin 2 8 Sm< 

P]it A x 4- A 2 cosS 

V (A x 4- A 2 cosS) 2 4- Af sin 2 8 

A 2 sin8 

V(A X 4- A 2 cosS) 8 4- A 2 2 sin 2 l 

Then 4- % = V 4- A 2 2 4- 2 A X A 2 cos 8 (cosait 4~ 4) 

= A cos(arf 4- 4)- (10) 



. . (7) 

= COS 4', 

• • (8) 

= sin</>. 

■ ■ (9) 


According to the expression under the radical, the amplitude A of the 
resulting vibration iB given as the third side of the triangle having the 
amplitudes A x and A 2 as adjacent sides, and 8 as exterior angle (fig. I). 
The difference in phase 4 between the resultant vibration and is 
the angle between the sides A and 14 of this triangle. 

This construction corresponds exactly with the addition, in the 
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Gauss plane* of two complex numbers, which add like vectors. To 
obtain the resultant of two vibrations having equal frequencies, but differ- 
ing in phase, add the corresponding 
complex numbers. The amplitude, 
and phase of the resultant are given, 
respectively , by the length and direc- 
tion of the radius vector representing 
this sum , 

In electrical engineering, where this method is most frequently 
used, this representation is referred to as the “ vector diagram ”, on 
account of the way in which the quantities are combined. It must be 
remembered, however, that the complex numbers by means of which 
the vibration phenomenon iB represented are not vectors in the physical 
sense; the letter are spatial, and are not confined to a plane. For this 
reason, it would be more correct to speak of the “ index diagram ” 
than of the vector diagram. This diagram may be drawn for any 
instant of time; but smce the phase difference is constant, the entire 
triangle rotates as a rigid figure with the angular speed w, as the 
time advances. We may therefore choose any position — for example, 
the real axis — for the line of the first vibration. 



If more than two vibrations are to be compounded, we obtain the 
index of the resultant as before by laying off the individual indices in 
succession — like vectors — connecting the initial point of the first 
index with the terminus of the last. A case occurring frequently in 
practice is the combination of a large number of vibrations of equal 
amplitude, there being a constant difference of phase A <f> between 
successive vibrations. The vector diagram then becomes a portion of 
a polygon, which approaches a circular arc as the individual amplitudes 
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decrease (fig. 2). The phase difference between the first and the Nth 
vibration is (N — - 1)A<£, which may be written N A <f> if N is large. 
The greatest value of the resultant amplitude is Na, obtained when 
A<£ = 0. As &4> increases, the- broken line becomes a circular arc of 
greater and greater curvature. If the arc closes te form a full circle, 
i.e. if A <f> = 2tt/N, the resulting amplitude vanishes. As the phase 
difference further increases, a small arc remains after the circle is 
closed. This arc again yields a finite value of the resultant, until, at 
A <f> = irr/N, a second zero point is reached, and so on. 

For arbitrary phase differences A</> we obtain, from fig. 2, a 
resultant amplitude 



sin N^ 

where rN A<£ == Na, i.e. A = Na ' . (U) 

2 ~ 


or, putting N A(f> — <f>. 


A 



(IF) 


The representation by complex numbers also gives a simple con- 
struction for the derivative. If 


then 


2= A&“\ 

da . . ■ 

— tatAe?" — noz. 


This is a complex number obtained by multiplying z by to and turn- 
ing it through an angle it/ 2 in the counter-clockwise direction. Dif- 
ferentiating again, 

^-5 = —a> i Ae‘“‘ = — ojh. 


This is a complex number whose index again falls in the line Oz, but 
m the opposite direction. We see, in addition, that a simple harmonic 
vibration satisfies the differential equation 


W 


+ 


< 0*2 — 0 , 


(12) 


which is called the differential equation of an (undamped) simple 
harmonic vibration. Similarly, the indefinite integral 
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is found by rotating the index z through an apgie w/2 in the clockwise 
direction, and dividing by to. 

2. Representation of more complicated Periodic Phenomena by Series 
of Harmonic Terms. Fourier Series. The Fourier Integral. 
Let a purely arbitrary process, which is repeated v times per second, 
be represented by u — f(t). That is to say, we have 

/(*+;) = /( 0 . (13) 

Let us make the proviso that f(t) is not to pass through infinitely many 
maxima and minima within the time of one vibration, and that the 
integral jf(i)dt is to converge even if the function has a finite number 
of infinite values in this range. A finite number of discontinuities is 
also permissible. Under these conditions the function f(t) may be 
represented over a complete period, and thus from t — — oo to 
t = 4: oo » except at the discontinuities, by a series of simple harmonic 
functions, the frequencies in which are integral multiples of the funda- 
mental frequency. Such series are called Fourier Series, after their dis- 
coverer. For a proof of the possibility of developing /(J) in a Fourier 
suries under these very general conditions, we must refer to textbooks on 
mathematics. Here we shall give only the method of determining the 
coefficients. It is most convenient to start from the complex repre- 
sentation and write 

/(Q- « 0 + <h#“' + + • • • -f • ■ • 

+ + a_ 2 e~ 2 ‘“ < -f . . . 

= S a B e"*“‘. . . . (14) 

Since the left member is real, the coefficients of the series on the right 
must be such that no imaginary terms occur. To determine o 0 , we 
integrate both sides over one complete period, i.e. from 0 to 2w/u>. 
Because of the periodicity of the complex exponential function, all 
contributions from the terms on the right vanish, with the exception 
of a 0 , for 

r*” 1 " a r I 2 '/" a 

f a n e-‘“‘ dt — —A = — («*•* - e°) = 0. (16) 

Jo nuo[_ nun v ’ 

On the other hand, o 0 is determined by the equation 

jf* /(() c& = ~a 0 , or a 0 = — jf fit) dt =- f^j * (16) 

the usual uotetjou for the mean value of /(»). 
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To determine the other coefficients, we multiply both sides by e-” u, ‘ 
and integrate, as before, from, 0 to r/w. Again, all terms on the right 
cancel out (because of the periodicity of the imaginary exponentials) 
except the o n -term, which contains no exponential factor, and which 
gives the value (2 irjoj)a„ on integration. Thus we have 



a »-£f 0 /w*^**, . < 

• • • (17) 

and similarly 


. . . (18) 


The reduction to the usual real form of the Fourier series, 

/ (t) — eoswi-j- . . ,b„ cos n tot -f- . . . sin wt . . ,c„ sinwwf-|- ... 

= £ b n cosnwf + £ c* sinnwf, (19) 


is very simple. If the real and imaginary parts of each term in the 
complex form are separated, and the terms in a„ and a_ n are paired, 
then we have 

o„e’ B " < = ~ (cos n wf -f- i sin n wt) ] 


| J f(t) cosncotdl, — ij f (t) sinnwf d<|; 


— — (cosnwf — t sin r wt) 

2ir 

| f* /(f) cosnwf df + t^ /(f) sinwwf df|; 


( 20 ) 


a n e” int + a_ n e -m, ‘'' - cos ruoi J f ( t ) cos n wf dt 

-f- — sin wait j"' /(f) sin wait dt; 

w -2ir/o w -Zirlcu 

b Q — f (f); 6„ === - j /(f) cos nwf dt-,c„ — - j f (f) smnwf dt. 


It is to be noted that the summation extends from n = — oo to 
n ~ -f- 00 in the complex representation, but from 0 or from 1 to +co 
if the real representation is used.* 

By, inserting the expressions for the coefficients, we can represent 
the Fourier Series by a single summation. However, it is necessary to 


* In most physical problems /(t) is not given by an analytical expression, but by an 
esnpirioal curve, so that the integrals must be evaluated numerically or graphically. 
Special computing methods as well as mechanical devices called harmonic analysers 
have been worked out for this purpose. 
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assign a new symbol for the variable of integration, since otherwise 
t would be used for two different purposes. If the integration variable 
is called a, then 

=ys„tfh a) ^ it ~ ayda ‘ ■ • • (2i) 

We introduce the time of one complete vibration — the period 
T — 2tt/u>. Also, on account of the periodicity of the function f(t), 
we can take the range of integration from — T/2 to +Tj 2 instead of 
from 0 to T. Then there results 

b-+oo 1 r + TI 2 2«»\ , 

/«)= S U . . (22) 

n— « 1 •'-TV 

At this point we can pass to non-periodic functions, which may be 
looked upon as extreme cases of periodic functions whose period 
extends from — oo to -foo. If we put IjT = As, then 
»-+«> r +TH 

/(<)= 2 As/ f(a)e^~^da. . . (23) 

« J -m 

Now the definite integral J </>(s)ds is defined as the limit, for As 
infinitely small, of the sum 

2 <£(nAs)As. 

Also, J <f>(s)ds — J <£(s) ds -f- J <f>(s)ds— 2 <f> (n As) As. (24) 

From this result it follows that, as T grows beyond all bound, the 
expression (23) passes over into the Fourier Integral 

f(t) = f + *ds J +< f(a) e 2 ”**-— *) da = f + ”e 2 ™‘ds J + f{a)e~ z ^dot.. (25) 

The second form discloses the meaning of this identity: the non- 
periodic function / may be expressed by a continuous series of har- 
monic periodic functions whose amplitudes aro given as functions of 
the sequence number s by 

£/(a)e— do. 

The possibility of such a representation is of great importance in the 
analytical treatment of functions which are otherwise not expressible 



THEORY OF VIBRATIONS AND WAVES 


67 


II.] 

by a unified mathematical expression. Let there be given, for 
example, a simple harmonic vibration which, however, is not of 
indefinite duration, i.e. we suppose it to start at t — 0 and cease art: 
t = T. Such a vibration cannot be represented by e*™*, since this is 
the expression for a vibration existing from t ~ — ao to f = +oo. 
We can, nevertheless, insert the function /, so defined, in the Fourier 
Integral and determine the distribution of frequencies therein. This 
distribution has, naturally, a maximum for e— v. The Bhorter the time 
the vibration lasts, the wider the “ frequency band ” covered by it. 

By taking the quantities in pairs and applying the Euler Formula, 
as in the case of the Fourier Series (p. 55), we can reduce the complex 
form of the Fourier Integral to a real form: 

/(*) = 2f" d$J + f{a)co&2irs(t — a)da. . . (26) 

Ex. 23. Develop, in a Fourier Series, the periodic function which has the value 
0 from 0 to T/2, and the value 1 from T/2 to T, then 0 once more, and so on- — 
sometimes called a “ square wave 

Ex. 24. Express e^'costai by a continuous scries of undamped harmonic 
vibrations, and find the frequenoy of the component vibration for which the 
value of the square of the amplitude is half the .maximum value. 

3. Modulated Vibrations and Beats. 

In wireless telephony we deal with “ modulated ” vibrations in 
the transmitter. These are oscillations in which the peak amplitude 
itself is a periodic function of the time. The amplitude, however, 
changes slowly compared with the frequency of the actual vibration. 
The latter vibration- -the so-called “ carrier wave ” — has a frequency 
of the order of 1,000,000 sec -1 , while the frequency of modulation — 
which is the frequency of the radiated tone — is around 600 sec -1 . 
Using real functions, a modulated frequency of this kind may be 
represented by 

u = A coscaj costojt, (m 8 >«».). . . . (27) 

But, by a familiar formula of trigonometry, 

2 coso >jt coizcorf = cos ( cj 2 o> x )f cos (o> 2 — . (28) 

The use of this formula enables us to write (27) in the form 
A A 

u = _ cos(m 8 -f -f ^ cos(m 2 — , . (29) 

This signifies the combination of two vibrations of equal amplitudes, 
and having frequencies <o 2 -f- oq and o> 2 — co v which are very close 
together. In practice, this type of vibration may be produced in 
either of these two ways, i.e. by modulating a carrier frequency, or 
by combining two vibrations which differ but little in frequency. The 
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Latter case, because of its well-known acoustical manifestation, is 
known as the phenomenon of beats. The angular frequency or the 
beats is u> 2 ~ u>,_. It is to be remembered, however, that the phenome- 
non itself is correctly represented by either (27) or (29). 

The ooe-mrenoe of boats represents the simplest case of an alrmsi-ptriodie 
phenomenon. If, for example, <*>j and co 3 aro not commensurable, then, strictly 
speaking, the function u has no definite period, i.e. there is no fixed interval T 
such that the value of v, at any time t is repeated at time 7. Nevertheless, 
within a certain latitude of departure e, a “time of recurrence” may be defined. 

4. Combination of Yihsstions along different Axes. Lisaajous’ figures. 

Thus far we have considered perfectly general periodic functions u, 
which could be represented either in a co-ordinate system with t as 
abscissa and u as ordinate, or in the Argand diagram of the complex 
plane. Now we wish to give our function a particular meaning: let u 
be the co-ordinate a: of a point moving along the £-&xis. If, now, the 
other co-ordinates {components of the radius vector) are also periodic 
functions of the time, we inquire the nature of the curve described 
by P, the extremity of the radius vector. Wo restrict ourselves to the 
important case of motion in a plane, the 2 -co-ordinate being omitted. 
The motion in the ay-plane is to be distinguished, of course, from 
rotation of the index in a vector diagram. If we desire to employ the 
latter representation here also, we must introduce a complex plane 
with rotating index for each co-ordinate. For the discussion of the 
orbit, we thus find it simpler to start from the real representation and 
write 

X — a costal, ysssb cos (art — 8). ... (80) 

It is thus specified, for the present, that the two vibrations have the 
same frequency. We obtain .ridely different vibration figures for 
differing values of the phase difference 8. First, let us take 8 * 0. In 
this case, the point remains on the straight Hue 

y.^o 

x a 

which is the diagonal of a rectangle with sides « and b. This morion 
is called the linear form of vibration. If we next take 8 between 0 and 
«/% elimination of t in (30) gives the equation of the orbit as 

y = b | cos S + b I - g sin 8, 

y cosS^ — 6^1, — ^sin*S. . . (SI) 

This is the equation of a second degree curve. From the parametric 



THEORY OF VIBRATIONS AND WAVES 


II.l 

form (SO), it is evident that neither the x- nor the j'-co-ordin&te can 
become infinite, i,e. the curve most be an ellipse. Its principal axes 
may be determined by the methods of analytical geometry. For 
3 = tc/ 2 we have 

x — a cob cut, y = b sinc-g. 


This, however, is the familiar parametric representation of the ellipse 
a* r b*~ h 


whose principal axes are the co-ordinate axes. 

The direction m which the ellipse (30) is described may be seen 
from the parametric representation. If we start at t — 0, then P is at 
its greatest distance from O in the direction of the ?;~axis, i.e. the * of 
P is a maximum. As t increases, x decreases, but (if 8 is less than j?) 
y increases in the direction of the positive y-axia. This form of vibra- 
tion is called a left-handed elliptic vibration. If S is allowed to increase 
to the value v, 

, y b 

x — ® coscuJ, y = — b costef, - — . 

J x a 


This is the original linear vibration reflected in the y-axia. Assigning 
8 the value 3 nj'i gives the same ellipse as for 8 = w/2; but this curve 
is traversed in the opposite sense, and we 
have a case cf right-handed, elliptic motion. 
We again obtain the linear motion with 
which we started by putting 8 •= 2w. The 
intermediate ellipses may be drawn with the 
aid of the above considerations (cf.. fig. 3). 
If the amplitudes a and 6 are equal, we have 
the special case of motion in a circle. This 
may be either light-handed or left-handed. 

If we employ exponentials instead of trigonometric functions, 



2C — <.46*'”*, | »- 

y~ jj 2 = fotf”*-*). f 


(32) 


This shows that the ratio of the components of the radius vector is 
real only when the phase difference is 0 or n, i.e. when the vibration 
is linear, for 


* *i 


{33) 


Thus the complex number representation provides a simple criterion of an 
elliptic vibration : the ratio of ike eomp;ments x and j is complex. 

If the two frequencies are uot equal, the phase difference is eon- 
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stantiy changing. For the case where the frequency difference is 
small compared to the individual frequencies, the phase relation 
changes slowly, and we can write 

x~ a cosa^J, 1 

y — b cosfa^ -h («2 — a>j)t] = b cos (to 1 t+ -t). j '* ' 

The phase difference 8 thus becomes a linear function of the time, 
and the point P traverses in smooth succession all the elliptical curves 
corresponding to the various values 
of 8. The resulting pattern, called 
a Lissctjous figure , after its dis- 
coverer, has the property of gradually 
covering the entire rectangle of sides 
a and b (cf. fig. 4). The Lissajous 
figure is the simplest example of a 
conditionally periodic system. In this 
type of system — which is of impor- 
tance in Celestial Mechanics and, to a Fig. * 

certain extent, also in Atomic Me- 
chanics — each co-ordinate of P is, in fact, a periodic function of the 
time. The orbit itself is eloBed only when a condition of commensura- 
bility connects the two periods. In this case alone can we speak of a 
periodic motion. 

Since any vector (e.g. the vector representing the electric field 
strength) may be represented- as a radius vector, our considerations 
are perfectly general, and hold for the motion of the terminus of a 
vector, as well as for the motion of a material particle. It is further 
of importance to note that a right-handed and a left-handed circular 
motion of the same frequency and amplitude always combine to form 
a linear vibration, whose axis is determined by the phase difference 
of the two motions. If the left-handed motion iB 



% = a cos tat, \ 
y x — a sinarf, f 

and the right-handed motion is 

3% — ocos(orf — 8), 1 

y t = — a sin(ad — 8),/ 


(35) 

(36) 


then a simple trigonometric transformation gives 


a?x + *2 = 2o cos 5 cos(a>i — 8/2). . . . (37) 


Similarly, 


Vi + V* = 2o sin ^ cos (ad - S/2). 


(38) 
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This shows that the x- and y-components of the resisting motion do 
not differ in phase, and the vibration is linear. The direction of the 
axis is given by division of the two equations: 

^-“4 < 39 > 

This result is still mere easily obtained by transferring the circular 
vibrations to the complex plane, where they are represented by 

Zj = at** and z„ = oe . . . (40) 

The sum becomes 

z l + z i = a#' 1 '* { + «-.(»*-«/ 2) J 

== 2 ae ,t/2 cos (cot — 8/2) (41) 

This is a vibration taking place in a line making an angle 8/2 with the 
real axis. 

This- method, which is always applicable to 
circular vibrations, is not to be oonfused with the 
vector diagram introduced in § 1 (p. 52). There, 
only the real part of the complex number had 
physical significance; in circular vibration, the 
complex number itself gives the correct picture 
of the process. 

The same result is obtained diagrammatically 
by combining the instantaneous radius vectors, 
Fig. s and again the geometry of the figure gives the 

angle 8/2 (fig. 5). 

Ex. 25. Draw, for several values of the phase difference, the closed vibration 
figure resulting from combination of an x- and a y- vibration having the ratio 
of frequencies 1 : 2. 

5. The Propagation of Periodic Disturbances in the form of Waves. 

Hitherto we have confined our attention to the particular point 
in space at which a given vibration is taking place; in what follows 
we shall consider all points of the region. First we take the simple 
case in which motion in a certain direction (call this the sc-axis) is inde- 
pendent of the y- and z-co-ordinates, i.e. is a function of x alone. At 
x — 0 let 

«(<, 0) = Ae<** (42) 

We call a process a wave if the same state of vibration exists at x — x 
as at x = 0, but with a difference of phase corresponding to the finite 
velocity of propagation v of the phase.* That is, at the time f, the 

* * j» algo referred to as the “ wave velocity 
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same condition obtains at sr as held at 0 at the time t — s/t>. We thus 
write * 

u(t, x) — ...... (43) 

In this way u becomes not only a function of the time, but also of 
the co-ordinate z. If we imagine a snapshot of the process to be made 
at a time f c , then the corresponding equation is 

u{t 0 , z) ~ Ae*"‘*e“ ,<: 3. ..... (44) 

The instantaneous value at 0 recurs if z increases by an integral 
multiple of 

2ttv ® 


for in this ease the argument of the imaginary exponential increases 
by 2w, and the function remains unchanged. This distance, which gives 
the separation of the points of equal phase, is ca lled the wave-length X. 
From the above considerations, A obeys the fundamental equation of 
wave theory; 

Xv^vi ...... (46) 

Wave-length x Frequency -- velocity of propagation of the phase. 

A wave of the kind described, in which « depends, in addition to 
the time, only upon the co-ordinate of the direction of propagation, 
is called & plane vMve, sines u is constant in any plane perpendicular 
to the direction of propagation. The simple harmonic plane wave is 
a particular integral of a partial differential equation which is readily 
deduced. If u is differentiated twice with respect to i, and twico with 
respect to x, we obtain 

and |5j«* - - (46) 


so that 


3 % | Wu 

3# tj 8 3 ? 2 * 


(47) 


The general integral of this second order partial differential equation 
most contain two arbitrary functions. As may be verified readily, 
any function f oi (t — */«} satisSet the equation, but so does any 
function g of (* 4- as/ v). The latter consgponds to a wave moving in 
the direction of the negative ss-axis. The general integral is then 

«-/(<- 1 )+ ,(>■ +$) («> 

* in the simplest cose, as taken hew, the amplitude A is independent of the position 
in space. Nevertheless, A may also be a function of x. Thus, in an absorbing medium 
the amplitude fails off as Here, however, we consider only undamped waves 

whioh, In our case of dependence solely on x, have constant amplitude. 
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For a plane wave whose wave normal ft (i.e. the perpendicular to 
the planes of equal phase) is arbitrarily oriented with respect to the 
axes, the planes of equal phase have the equation rn = const., and 
the plane wave is represented by 

. (43') 

If this equation is differentiated twice with respect to t and twice with 
respect to each co-ordinate, we can form the most general differential 
equation defining an undamped wave, of which (43') is only one 
particular integral. This equation turns out to be 


8 *« 3 *u , 

+ dy* ^ 32 * 1 


A u 


l_d*u 
u 2 dt *' 


(49) 


Another very important special case is that of a spherical wave; here 
the function depends only on the distance r from a centre 0. The trans- 
formation of Am to this case — the special case of spherical polar co- 
ordinates when u depends on r only — is not difficult if we use the 
vector meaning of Am, viz. div grad u. We have 

grad u(r) = ~ i r 


, . , .. du 1 ,. rrfdPul du 1\ 

dhi 2 du 
^d^^rdr' 

Then (49) becomes S 2 U 2 du 1 S 2 m 

+ (50) 


A particular integral is 

...... ( 51 ) 


as may be seen readily by substitution, or by wilting the equation in 
the form 


5 («)■ 


1 3 2 
'v* a7 a 


(fM). 


This is a simple harmonic wave spreading equally along radii in 
all directions from the origin. In contrast with plane waves, the 
amplitude falls off as 1 fr, corresponding to the increase of the wave 
surfaces. The surfaces of constant phase are given by r = const.. 
and are therefore spheres. The general solution, consisting of one 
wave emanating from 0 and another closing in towards 0, is 
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« — (!/»•){/(* — r/v) -f g{t + r/v) }. . . . (52) 

Ex. 26. Give the differentia! equation of a cylindrical wave, and find its 
integral. 

6. Combination of several Waves having the same direction of Propaga- 
tion; Linearly and Elliptically Polarized Waves; Group Velocity. 
(a) Phase velocity independent of the frequency; absence of dispersion 
If u at x = 0 is a complicated function of the time, representable 
by a Fourier Series, and if there is no dispersion, then the phenomenon 
is repeated at every point with a phase difference eorresponding to the 
velocity of propagation. In order to represent such a wave, we have 
only to replace t by t — xjv in the Fourier Series. If the quantity 
satisfying the wave equation is of the nature of a vector, we speak of 
pure lortgitudinal waves if this vector is along the direction of propa- 
gation, i.e. the x-axis. In this case everything is cylindrically sym- 
metrical around the x-axis, and there is no uniquely distinguished direc- 
tion in the planes perpendicular to this axis. If, on the other hand, 
the vector has no component in the x-direction, we speak of pure 
transverse waves. If there is no phase difference at 0 between the two 
components of the vector — i.e. if the vibration is linear, according to 
§ 4 (p. 58) — it remains linear at every point, if the phase velocity is 
the same for both components. In this case we speak of linearly 
polarized waves, and refer to the plane determined by the directions of 
propagation and vibration as the plane of vibration. If the vibration 
is elliptic at 0, it is elliptic, under the same conditions, at all points in 
space, and we have an elliptically polarized wave, of which a special 
case is the circularly polarized wave. . 

The case in which the phase velocity is somewhat different for the 
two components is of importance. If we again represent the vectorial 
point function by means of a radius vector in the planes perpendicular 
to the x-axis, 

y— a cos co (^t — 

z—b coso) (t — =* b cosat x( — — — Yj. 

V vj L Vi vjj 

Comparison with formulae (30) (p. 58) shows that the phase difference 
varies with the position along the x-axis, independently of the time 
(cf. fig. 6). Points of equal phase difference, and therefore having the 
same vibration figure (e.g. a straight line), are apart a distance A, 
given by 

cuA^— — — \ = 2 tt or • < 63 ) 

Vi V A A x A, 

we obtain a phase difference varying with the co-ordinate 


Similarly, 
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if we have two oppositely directed circularly polarized waves of equal 
amplitude, but having different velocities of propagation. According 



Fi*.6 


to § 4 (p. 60), two such vibrations always combine to give a linear 
vibration, and the position of the axis of vibration is given by the 
phase difference. If the latter varies from place to place, this signifies 
a gradual azimuthal rotation of the axis of vibration as we move in 
the direction of propagation. 

(b) Dispersion; phase velocity dependent upon frequency 
If the partial vibrations composing a complicated vibrational 
phenomenon are propagated with different velocities, we -obtain in 
general, by superposition of ail these vibrations, a non -periodic con- 
dition, concerning whi.cb little can be said. It is only when the 
partial vibrations are close together in frequency that the nature of 
the result is clear. The simplest case is that of two slightly different 
vibrations of equal amplitude which produce a modulated vibration 
by means of beats. As long as there is no dispersion, the modulation 
(in radio, the audio-frequency note) is, of course, transmitted with the 
same velocity as the carrier wave. If, however, dispersion exists, we 
obtain another velocity — the group velocity — as the following simple 
computation shows: 

Let toj = <o -f- Aco and tq = v -f- ^ Aw. 

Then the modulated wave becomes 
F(x, t) — cos aq (t — -f cos 

We may set aq = to and tq = v in the sum, without appreciable error. 
Then 

F(x, f)=2 cos cos j — ^ t - i * j. 

uni) 
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Neglecting higher terms, 



Thus one obtains 

F(x, t) = 2 cos — 0 coscu (t — 
where, for brevity, we have Bet 

. , i~ d~ . 

I to dv v ___ v _ 1 

v v 1 dm dm dv V’ 

The meaning of this result is as follows: We have a wave of 
angular frequency to ; its amplitude is subject to a wave-like periodic 
variation * of angular frequency Aw. While the phase of the wave is 
propagated with velocity t>, the velocity of propagation of the modu- 
lation has the value V, determined by 


1 

V ~ 


d- 


v 

fa- 


(54) 


It is useful to express the group velocity also in terms of X. i 
(64) may be written 


and since vX = v, 

, dv , dv 
*dx + v ~dX* 

Substituting the value of Mdv/dl) into the above, v 

V « e - X - 
v v * 


The velocity F is, in general, identical with the signal velocity, 
i.e. the velocity of propagation of a signal. If we have a vibration at 
the origin 0 starting at time t — 0 and terminating at t~T, the 
function which represents it is in general non-periodic, but may be 
represented by a continuous sequence of harmonic functions, accord- 
ing to § 2 (p. 56). If the duration T of the original vibration is 
small, we obtain a narrow band of side frequencies, for which, if the 
dispersion is not too great, the derivative dv/dm may be regarded 
as constant. In the same way as a modulated wave results from 
superposition of two slightly different primary waves, a vibration 

* Because the algebraic sign oi the amplitude is of no concern, the frequency is not 
Ace/2 but Ace (of. p. 66). 
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testing for a limited time at any one point results in this case by 
superposition of all the partial .frequencies. The velocity of propa- 
gation of the limited wave train is then given by V. Since signals can 
be given only by initiating or terminating a vibration process, and 
not by individual crests or troughs of an unlimited wave, V is the 
velocity of propagation of acoustical or optical signals, as obtained 
by measurement. 

In the region of anomalous dispersion, the signal velocity, i.e. the velocity 
of propagation of the energy, no longer coincides with the group velocity F, which 
loses ita significance in this instance. 

Ex. 87. How must the phase velocity v depend on v, in order that the group 
velocity F be proportional to the reciprocal of the phase velocity? 

7. Combination of Waves having the same Frequency hut dilferant 
directions of Propagation. Standing Waves. 

(a) Superposition of two warns moving in opposite directions 
Lot — Af^- t ~ x,v ' > 

represent a wave travelling in the direction of the positive sr-axis, 
and let 

U2 = J e <«<f+*/o)+a 

represent a wave of the same amplitude and frequency travelling 
in the direction of the negative ss-axis. The sum is 

- M oos^ij? + (66) 

This result is to be interpreted as follows: 

A^+ti a) 

represents a vibration having the same phase all through space. The 
factor 

/tax , S\ 

c08 (t + 5) 

indicates that the amplitude is a periodic function of The period 
is v/v A. The chief characteristic of a wave, via. the finite velocity 
of propagation of the phase, is entirely absent. For this reason, the 
phenomenon is called a standing wave. Ji the two amplitudes are 
unequal, there exists, in addition to the standing wave, a wave travel- 
ling in the direction of the stronger partial wave, whose amplitude is 
the difference of the amplitudes of the components. 
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(S) Superposition oj two plane wa ves meeting at an angle 

A case arising especially frequently in optics is that of a plane wave 
reflected from a plane surface which we shall take to be the xy-plane 
of a co-ordinate system. Call the angle between tie wave-normal and 
the z-axis a. If we let the wave-normal n fall in the yz-plane, as in 
fig. 7, then, by the laws of optics and acoustics, the normal to the 
reflected wave, will also lie in the yz-plane, and will also make an 
angle a with the z-axis. For the incident wave, the surfaces of equal 
phase are the planes perpendicular to n. These planes have the 
equation rn — const. The equation of a wave moving in the direc- 
tion of the unit vector 

n — (cos «.) i -j- (cos f$)j -f- (cosy) A 

is thus « = A(h( t ~’v) — Ae i "’( c ~~ COS +y ~ 0 -* 

From this equation there follows at once, by putting n — i, the pre- 



vious form of a wave moving in the x-direction. In our case we have 
(cf. fig. 7), .. 

n — (sin a)/ — (cos a) k) ri = (sin a)j -(- (cos a) k. 

By superposition of the two waves we obtain 

-2 

This is a wave travelling in the direction of the y-axis, the wave-front 
being modulated with the period A/cos a. The phase velocity of this 
combination wave is e/aina. For a = 0 we again have the case of a 
standing wave. 



CHAPTER III 


Selected Topics in the Theory 9F Functions 
oe a Complex Variable 


1. Conformal Mapping of one Plane on another. 

Since many problems of theoretical physics may be solved par- 
ticularly simply with the aid of the Theory of Complex Functions, 
we give here a rapid account of some of the most useful results of 
function theory. For rigorous deductions of these results, however, 
the reader should consult textbooks on the subject. 



If the argument of a function, e.g. the sine function, is a complex 
number z = s> -f iy, then, in general, the function will also be a com- 
plex number w = u -f- iv. If we represent the complex number z by 
a point in u plane — the Gauss plane or Argand diagram— and the 
complex number w by a point in a second plane, then the function 
f(z) establishes a correspondence of the points of the w-plane with those 
of the z-plane, and we say that one plane is mapped on the other. 
The function /(z) is said to be monogenic at those points where w is 
finite and where the limit 


Ar-*-0 A 2 


( 1 ) 


is finite and independent of the way in which the point z is approached 
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as A z tends to zero. We now state a theorem: Within a region in 
which w is regular, the 2 -plane may be mapped on the w-plane, with 
the exception of those points where f'(z) = 0, by means of a function 
w =“/(*)> in such a way that corresponding small portions of each 
plane are similar and angles axe preserved (conformal mapping). For 
a proof, we consider a small triangle in the 2 -plane, determined by 
the points z 0 , z 0 -f A x * and z 0 -j- AgZ (%. 1). These points correspond 
to file points w 0 , tv 0 + A t w --- w 0 -f /'(z 0 ) A x z and w n -j- A 2 w = 
Wo + /'(«o)As«, in the w-plane. Thus, the quantities A jte and A s w> 
are obtained by multiplying the complex numbers AjZ and A^z by the 
same complex number f'(z 0 ). But this means that AjZ and AgZ are 
turned through the same angle and elongated in the same ratio.* 
The similarity of the two triangles is thus demonstrated; all angles 


z Plane 

La 1 «« 

|.-i. -i«4- 4« 4x4x4— 

i... 
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Similarly by eliminating y, 



The curves x = const, thus become, in the w-plane, hyperbolas with 
foci at. —1 and +1; the curves y — const, become ellipses having the 
same foci (fig. 2). It is known from analytic geometry that these con- 
focal conics form an orthogonal system. The strip of the 2 -plane 
between x — 0 and x — it maps itself into the entire «;-plane. To 
represent the entire z-plane requires a w-plane of infinitely many 
sheets. 

Ex. 28. Discuss the transformation w = 1/*. 


2. The Cauehy-Rlemann Conditions and the Differential Equation of 
Laplace. 


The real part u and the imaginary part o of w — u + -«? are func- 
tions of the two variables x and y. Nevertheless, two arbitrary func- 
tions u{x, y) and v(x, y) cannot, in general, be considered to be the 
real and imaginary parts of a function of a complex variable. If the 
functions u and v originate in a complex function, they satisfy certain 
special conditions. If we denote by/'(z) the derivative of the function 
/ with respect to its argument, then, since u -(- w ~~/{x -f- iy) 


du 

dx 


, dv 

*dx 


-/'wlj-m 


dy dy 


Comparison gives the Gauchy-Riemann Differential Equations 
du_dv 
dx dy’ 

3u___ 
dy ~~ dx' . 


(2) 


These relationships must hold between u and v, but neither of the 
two functions may be chosen at will. If we differentiate the first of 
the above equations with respect to x, the second with respect to y, 
and add, we obtain Laplace’s Differential Equation 


d*u d^u 
da? + 3 ^ 


= Au « 0. 


.... (3) 


Similarly, by differentiating the first with respect to y and the second 
with respect to x, and subtracting. 
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d*v (Pv 
dx 2 dy % 


Av — 0. 


( 3 ') 


Hence both the real and the imaginary parts of the complex function 
w satisfy Laplace’s equation: Au --= 0, Av = 0. 


3. Line Integrals in the Gauss Plane; the Cauchy Integral Theorem. 

Form the integral J f(z)dz between two points P Q and P x of the 

CU 


curve C in the z-plane (fig. 3). Cauchy’s 
theorem is: The integral along a second curve 
O', between the same points, has the same 
value if the function f(z) is monogenic at all 
points in the region bounded by the two 
curves; or, what is the same thing, the con- 
tour integral (p f(z)da vanishes if the function 10 ^ * 

is monogenic in the enclosed region. To 
prove this statement, we separate real and imaginary parts: 




(u -f it>) (dx 4- idy) — (£ (udx — vdy) -f i <ji (vdx + udy) 

= j)Vds + i f&ds. 

The first integral represents a plane line integral (j) ¥ds if we put 
F*= u and F t = —v. But by Stokes’s Theorem (p. 27), since d S 
has a z-component only, 

^Fis =/curi FiS -//(-|;- f?) dxdy, 
and similarly, for the second integral, 

§ v *> -/ c “ i p * -// (is - *4) 


However, by the Cauchy-Riemann equations (2), the integrands in 
both double integrals vanish identically, which proves the theorem. 
Further, the following theorem may be proved: If the line integral 
along a closed curve C does not vanish, but has a finite value, on 
account of the existence of a singularity of the function in the region 
enclosed by 0, then the same value is obtained by integrating along 
another closed curve O', provided the function is monogenic in the 
region between the two curves. The two curves 0 and O' (fig. 1) form 
the boundary of a region to which Stokes’s Theorem may be applied, 
since the function / (and therefore also u and v) are regular within 
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this region.* It is to be remembered, however, that in applying Stokes’s 
Theorem, the boundary curve is to be traversed in such direction that 
the bounded surface is always on the left. Hence the direction in which 
the inner surface is described must be reversed; or, if the curves are 
traversed in the same sense, 

£~£ =0 - 

If there, is but one singular point z 0 , we may therefore integrate around 
a small circle of radius p, with centre at % In the greater number 
of cases, the singularity at z 0 consists in the function becoming 



Fig. 5 


infinite to a certain definite order »; i.e. a power series develop- 
ment at z 0 contains terms (z — z 0 ) -1 , ...(«— z 0 )~ n . It may be shown 
at once that the value of the integral depends only upon the term 
(2 — z 0 ) -1 , for if we put (cf. fig. 5) 

z — z 0 — pef*, dz — ipe'* d<f>, 

then 

j>(z- z 0 )”dz — J Sn ip n + l e^ n+1V4, d4 = 

On account of the periodicity of the complex exponential function, 
the integrals vanish for n =}= — 1, but for n — — 1. 

.... (4) 

I z — Zo J 0 pe* w 

If the path of integration encloses a singularity z 0 of the function f, the 

* From a strict, mathematical viewpoint it is allowable to apply the Stokes’s 
Theorem only after joining the two curves by a “ cut ”. In forming the line integral 
this out will be traversed twice in opposite directions, hence contributes nothing to 
the integral. By inserting the cut we get a single, continuous, bounding ourve that can 
be traversed in one operation. 



74 


MATHEMATICAL INTRODUCTION [Chap. III. 


value of the complete line integral is 2m times the coefficient of the term 
(z — z 0 ) -1 °f tte ■po'oer series development ( in powers of z — z 0 ), which 
holds in the neighbourhood of z 0 . This coefficient is called the residue 
of the function at Zq. If several singularities are enclosed, it is readily 
shown that the value of the integral is equal to the sum of the separate 
residues multiplied by 2 rri. If the point z—eo is enclosed, the sub- 
stitution z = Ijt, dz — — dt/t 2 is made, which reduces the point at 
infinity to the zero-point. Again the essential coefficient is that of 
the term in sr 1 , for this yields the term in 1 )t upon making the above 
substitution. 

Ex. 29. Using the Cauchy Theorem, evaluate the- integral 



by expressing c<ss<p in terms of e {l t and introducing e»^ = s as a new variable. 



CHAPTER TV 


The Fundamental Problem of the Calculus qf 
Variations and its Solution 

1. Statement of the Problem of the Calcuius of Variations, 

The question “For what values of the independent variable has 
a given function y— fix) a turning value?” is treated in elementary 
differential calculus. A necessary condition is the vanishing of the 
first derivative. Similarly, a necessary condition for an extreme value 
— i.e. a maximum or minimum, or turning value, or extremum-— of a 
function of several variables is the vanishing of all partial derivatives 
of the first order df/dx, df/dy. . . . The following question is much 
more difficult to answer: “ Given a definite integral whose integrand 
is a function of x, y and of the first derivative j i'i 

I —f h f F ( x > y> yO 

for what function y(x) is the value of the integral a m aximum or a 
minimum?” In contrast with the simple extreme-value problem of 
differential calculus, the function y(x) is not known here, but is to be 
determined in such way that the integral is an extremum. In geo- 
metry and in physics problems of this type meet us at every turn. 
The simplest example is furnished by the question: “ What is the 
shortest curve which can be drawn joining two given points?” In a 
plane, the answer may be given immediately: “ The straight line.” 
But if the two points and their connecting curve are to lie on a given 
arbitrary surface, then the analytic equation of this curve — which is 
called a geodesic— is found only by solution of the above extreme- 
value problem, which is the fundamental problem of the Calculus of 
Variations. 

2 Derivation of the Euler-Lagrange Differential Equation. 

We shall now show how the extremal problem of the calculus of 
variations may be reduced to the determination of the extreme value 
cf a known function. For this purpose we consider functions p of x, 
which are “ neighbouring ” functions to y. The function p is arrived 
at as follows: Let e be a small quantity, and let g(x) be an arbitrary 
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function of x , continuous, along with its first two derivatives, in the 
range of integration. Then we introduce into the integral, in place of 
y and y', the neighbouring functions § = y -f- oj and jf' = y’ -p erf. 
We stipulate, however, that these 
comparison functions $ coincide with 
the function y{x) at the end points 
of the range of integration (fig. 1). 

We must therefore require of the 
arbitrary function 17 that it vanish at 
the ends of the interval; for if the 
various curves connecting two given 
points are compared, an obvious con- 
dition is that all curves to be com- 
pared shall pass through these two 
points, and that no curves which do 
not pass through these points s3ja.ll 
be considered. If we substitute the neighbouring functions in the 
integral I, the integral becomes a function of e. We then require 
that y(x) make the integral an extremum, i.e. the function /(e) must 
have an extreme value for e = 0. 

7(e) = J F(x, y-\- eq, 1 / + evf)dx = extremum for e= 0 (1) 

This gives us a simple method of dete rminin g the extreme value of a 
given integral. The condition is 



We expand the integrand function F in a Taylor’s Series, according 
to powers of e, and obtain 

He) — J*^^F(x, y, y’) + e V + «?' ^ + terms in e 2 , e 3 . . . jdr. (3) 

If we differentiate with respect to e under the integral sign, as is allow- 
able, we obtain 

z-jCHT +’>’ fp+*™ !***.«■■ ••}*■ • <*> 

This expression is to vanish for e=0. The terms in e do 
this of themselves. Thus there remains the condition for an 
extremum: 



. ( 5 ) 
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The second term may be transformed by integration by parte: 

The first term vanishes since ij is zero at the limits. Then 

- (thO « 

On account of the fact that r) is perfectly arbitrary, the integral can 
vanish only if the bracketed expression in the integrand is zero. 
This gives the Euler-Lagrange Differential Equation-. 

i. df(s, y, y') _ ZF{x,y,y') 
dx dy ’ ' 

which must be satisfied by y, if y fa to make the integral an extremum. 
The essentially difficult investigation of the nature of the extremum 
{whether maximum or minimum) seldom arises in physics, and need 
not be treated here. On the other hand, the case where F is a function 
of several dependent variables y k and their derivatives is of great 
importance. In analogy with the above, we introduce as neighbouring 
functions 


#1 — Vi + ‘i’ll! • • • V*— 9k+ • • • 5/ — Vfh 


where the rj k are again to vanish at the limits of the integral. The 
integral then becomes a function of the / variables e 1 . . . e f . The 
condition for an extremum is, of course, the vanishing of the partial 
derivatives 


g 2 , . . . €f) 

C~i k 




d_ bf\ 

dx dy' k ) 


dx 


when = e* ...*=€, = 0. 

It follows as before that the coefficient of each of the functions ij 
within the integral sign must vanish, i.e. 


dxdy' k <iy k 


0 


(i-1, 2 ,•••/). 


The Euler-Lagrange equation therefore holds for each of the dependent 
variables. 


The diflerenoe 
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for t smaR, is usually termed the variation of the integral, and is written 
sJ Xl F(x, y, y')dx. 

We then write, instead of 

j* F{x, y, y')dx = extremum, 

the equation expressing the vanishing of the variation: 

$J* l F(x,y,y')dx=* J*'$F(z, y, y'Jdw «= 0, 

for which the Euler-Lagrange equation must be satisfied, just as for (1) above. 

We call S F the variation of F. An important conclusion may be drawn 
from this formulation of the variation problem. Let F be the total derivative 
of a function 0(x, y): 

F(x,y, 

In this case the integral may be evaluated immediately: 

= jf* ^ a») - G(x <» y>^ 

Since the varied function coincides with the original function at the limits, we 
have for every connexion between y and as 

s jf >*= - m=h, Vi) - 8<?(*c y«) = o. 

That is, any arbitrary functional relationship y(x) satisfies the condition that 
the variation of the integral shall vanish, and the extremal problem no longer 
has any meaning. 

Ex. 30. Prove the following: If, in the integral whose extreme value is sought, 
the integrand F does not contain x explicitly, then a first integral is 
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CHAPTER V 

The Mechanics of a Single Particle 


1. The Fundamental Concepts of Kinematics. 

We shall concern ourselves, in this first chapter on mechanics, with 
the motion of bodies whose extension in space may be neglected, and 
which may therefore be re- v 

presented schematically by y+dv 

means of particles, or 
points endowed with mass 
(cf. § 2, p. 84). The posi- 
tion of a particle is indi- 
cated by a radius vector r 
drawn from a fixed point 
0. If necessary, we may 
resolve this vector into 
components. At the time 
t, let the particle P be at 
a point in space specified 
by the vector r. At the 
time t + dt the vector is 
r+ dr (fig. 1). The velocity 
is the vector v resulting from taking the limit of drjdt for dt infinitely 
small, i.e. 

m 



The direction of the velocity vector is thus given by the small vector 
dr having the direction of the tangent to the path. Now, in general, 
the vector v will itself be a function of the time. We obtain a vivid 
picture of the trend of the velocity if we draw the vectors v from a 
new centre O x as radius vectors. Then there is a definite terminus of 
a radius vector v corresponding to every instant of time. If all these 
points are joined, we obtain a new curve, called the hodograph of the 
motion. While the particle P traverses the actual orbit, the “ image 
point of the velocity”, aa defined above, moves over the hodograph. 

* When not explicitly written out, each differentiation with respect to time will be 
indioated by a dot over the appropriate quantity. 
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The acceptation is the limit of the change in v per unit time, i.®. 


Since v is the radius vector of the point on the hodograph, the accelera- 
tion of the particle itself is equal to the velocity of its image point on 
the hodograph. 

The velocity V is said to be uniform only if the vector v is constant. 
This definition requires that the motion be rectilinear. Motion on a 
curve in whioh the magnitude of the velocity is constant— i.e. where 
equal arcs are traversed in equal times — can nevertheless not be Baid 
to have uniform velocity. ' 

We may resolve the acceleration in various ways: 


(a) Tangential and normal acceleration: 

dr dr ds , 

'’“a?''**-**- 


where t is the unit tangent vector (p. 18) and v is the magnitude 
Differentiation gives 


(3) 

of v. 


a 


dv . dids 
dl V ds dt 


= 7» t '+- n 


. • . (*) 


[cf. eqn. (32), p. 19]. Here n signifies the unit normal vector pointing 
toward the concave side of the curve, and p is the radius of curvature. 
The tangential acceleration has the magnitude 


dv __ d% 
dt ~ dt*’ 


(5) 


The normal acceleration, directed toward the inside of the path, is 



From (4) it is evident that the acceleration is always in the osculating 
plane. 

(b) In rectangular co-ordinates: 

r = xi+yj 4- zk, ] 

r= xi+yj+zk, [- (7) 

r = xi + yj+ zk. ) 




That is. 


a„= x, a„ = y, a,= s 
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(e) The resolution in plane polar co-ordinates r, <f> is also of im- 
portance. For this purpose we introduce two mutually perpendicular 
unit vectors e, and e$. These vectors have, respectively, the direc- 
tions of increasing r and increasing <f>, and unlike the vectors i, J, h. 



Ffe,» 


change their direction with time. Since the derivative of a unit vector 
is perpendicular to the vector, we have (cf. fig. 2) 


de,_ d<l> de i 
dt ~~ e * dt’ dt 


e. 


d<f> 

dt’ 


and we obtain, from r = re,, 

r — re, + re, — re, + <j>re 
and, by a second differentiation, 

y = fe, -f re, - f- ifre#- -j- e+ 
= (f — rj>*)e, -f* (<fr •+• 2<j>i)e$. 

Thus 



2. Newton’s Second Law of Motion. 

(a) The law of inertia 

The first of Newton’s laws states that every body tends to remain at 
rest or in uniform motion in a straight tine, unless acted upon by external 
influences. This law, commonly called the Law of Inertia, is by no 
means axiomatic, but represents an ingenious extrapolation of our 
physical experience; for we cannot free a body completely from 
external influences, since the gravitational effects of the earth and 
of the stars am always present. 
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(b) Faroe and acceleration 


11 we wish to alter the state of motion of a body, e.g. a sphere 
resting on a horizontal plane, by the use of our muscles, then a cer- 
tain effort, increasing with the resulting change in velocity, is re- 
quired. This effort, which we call force, is a direct sensory impression, 
and hence not further definable. The direction of the acceleration is 
determined by the direction in which we allow our muscles to work, 
so that force, like acceleration, is a vector. But from common 
experience we know that different amounts of force are necessary to 
produce a given acceleration, depending upon the nature of the body 
acted upon, e.g. a sphere of iron or of wood. Reciprocally, the effect 
of a given force depends upon the nature of the body to which it is 
applied. The direction of the acceleration, however, is the same for all 
bodies. We must therefore assign to each body a scalar property 
which we call the inertial mass m. The simplest equation embracing 
these factors is 


F — ma = 


dv 

dt 


m 


iPr 

di*' 


( 10 ) 


Since we have introduced no factor of proportionality, the unit 
of force is determined by using the fundamental 0.0.8. units, 
i.e. length in cm., mass in gm., and time in sec. The force which 
imparts to 1 gm. of mass an acceleration of 1 cm. /sec. 8 is called 
1 dyne. 

Throughout Newtonian Mechanics, to which we limit ourselves for 
the present, it- is tacitly assumed that the inertial mass m is a constant 
individual property of a body. A more cautious way of writing the 
fundamental law (10) leaves this question open: 

*=I w (,0,) 


The quantity u — mv introduced here is called the momentum New- 
ton’s second law, stated in terms of momentum, is then: The force is 
equal to the change in momentum per second* 

All about us in nature we see changes taking place in tbo stats of 
motion of bodies, without exertion of muscular effort on our part. 
We interpret the cause of these changes as " forces ” which operate 
on the bodies in the same v/ay as our muscles do. The investigation 
of these forces, which may be of a very diverse nature, is an important 
problem in physics. In mechanice, however, we take the force to be 


S A wore complete genoral statement smda: " Sale of change of momentum is -propor- 
tional to the force jrrodr.cmj) it , and this change takes place in the direction in tohich the 
force act*:. " 
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given, and confine ourseives to computing the effects. If several forces 
act on a body, each produces its own acceleration: 

$'i --- ma t . 

But since the resultant acceleration is the vector sura of the separate 
accelerations, and the inertial mass m. is the same for all, 
m a— mil cif ----- 2 P # ~ f. 

We may therefore replace ail the separate applied forces F< by their 
resultant P in order to calculate the acceleration. 

Like most physical Jaws, the Newtonian Law of Motion has the 
form of a differentia! equation. The variation in any interval of time 
is obtained from this equation by continuously repeated application 
of the law, starting with given initial conditions, i.e. by integration of 
the differential equation. The unique power of theoretical physics is to 
be found in the circumstance that the laws of nature can be formu- 
lated in terras of differential equations. The fact that we can deduce 
descriptions of such apparently diverse phenomena as free fall, pro- 
jectile motion and planetary motion from a single law depends on the 
cirfeumstane-e that what happens in a short interval of time in any of 
these cases is dedueibie from the same relation. 

In most cases eqn. (10) (p. 84) must be expressed in terms of the 
co-ordinates appropriate to the problem. Thus we obtain, m general, 
a system of three differential equations. Since the solution of three 
differential equations of the second order contains six constants, a 
complete description of the _ phenomenon is obtained, only if these six 
constants are known. These are uniquely determined by giving the 
initial position and initial velocity, each having three components. 

Ex. SI. A point moves subject to a periodic force F = A sin tot. Compute the 
motion for various initial velocities. 

Ex, 32. The force acting on every mass m at the earth's surface is propor- 
tional to *» and is directed toward the centre of the earth. In a limited region 
we can take the force to be constant, and the earth’s surface to be plane. Inte- 
grate the equations of motion for this case, foi various values of the initial velo- 
city, and draw the orbit and the hodogioph of the motion. {Theory of projectiles 
in vacuo.) 

3. Time Integral and Path Integral of the Force. Work and Energy, 
(a) The time integral 

We can make two different statements for the effect of a force 
acting on a particle for a certain length of time, each of which leads 
to relationships having meaning. First, we can multiply the force 
by the time during which it acts; secondly, we can take the scalar 
product of the force by the vector displacement which it causes. 
If the force itself is variable, then instead- of the finite time interval 
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we have the sum of the products of the instantaneous time values 
of the force by the time element, i.e. the integral / F dt. A similar 
formulation holds for the second product. If we apply the fundamental 
law, the time integral becomes 

-*•-(*) - - <U) 
The time integral of the force (also called the impulse * of the force) is 
equal to the change of momentum. 

The impulse law (11) is especially important for a large force which 
acts for a very short time A/, so that the product VAt remains finite, 
and thus causes a finite change of momentum. Observation of the 
ohange of momentum gives the product FA t immediately, _without 
requiring a knowledge of the way the force varies. Here F repre- 
sents a mean value of the force, viz. the time average. 

(6) The path integral of the force 

If a force F acts on a particle, ^fe work done by F when the 
particle moves through a displacement f dr is defined to be the scalar 
product F dr. 

For a finite displacement along a curve 0, the work is the sum of 
such contributions F dr, i.e. we have, for the work, the integral 

W*=f'Fdr. (12) 

If we substitute for F the equal vector m(d 2 r/dt z ), and introduce the 
time as variable of integration by using dr = ( drjdt)dt , then we 
obtain 

= (^) ] = \m (t> 2 * — V)- . . . (13) 

The entire amount of work done by the force is thus given by the 
difference between the initial and final values of the expression Imr 2 , 
which we call the kinetic energy of the particle. Thus the total work 
done by the force is equal to the difference of the kinetic energy in the 
initial and final states. From the definition of work, and the funda- 
mental property of the scalar product, it follows that a force which 
is always perpendicular to the orbit of a particle can do no work, i.e. 

* In view of equation (11), the term “ impulse ” haa come to be used not only for 
momentum but for the produot m drjdl, although the word originally referred only to 
the left side of the equation. 

f The path differential is called dr instead of ds here to indicate explicitly that the 
position of the partiole is given by the radius vector r. 
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can cause no change in the kinetic energy. Such a force can change the 
direction of the velocity, but not its amount. 

Ex. 33. A coin lies on top of a card which in turn rests on a drinking-glass. 
Why does the coin remain on the card if the latter is slowly drawn away, while 
if the card is snapped away suddenly, the coin falls into the glass? (Remember 
the empirical fact that the friction between solid bodies is independent of the 
relative velocity.) 

Ex. 34. Decide whether or not the fatigue experienced in holding a weight 
with arm outstretched has any connexion with work done. 

4, Conservative Forces; Potential. 

Consider a particle P in a field of force, i.e. in a portion of space 
at every point of which there exists a force F, uniquely determined 
by the co-ordinates. Let the particle P be moved from P 0 to P x (fig. 3). 
We divide forces into two classes, ac- 
cording as the work done in the dis- 
placement does not or does depend 
upon the path taken. This difference 
may be characterized- somewhat dif- 
ferently if we' consider also a return 
path P X P 0 . With the first kind of 
force, the integral along P l P f) has the 
opposite value, since the curve is traversed in the opposite sense. 
If, then, we traverse the closed curve PqP^Pq, the integral vanishes 
if the force is of the first kind. Forces of this type — to which, 
for example, the force of gravity belongs — are called conservative 
forces. If a body is projected vertically upward in a vacuum, 
with a given initial velocity, it will return to the starting-point with 
the same speed; the kinetic energy has not changed, and no work 
has been done on the complete path. The situation is quite different 
if the resistance of air or of any fluid operates. In this case the body 
returns to its point of projection with greatly reduced speed, and the 
integral j) Fdr cannot vanish, according to equation (13), p. 86. 
Forces belonging to this second class are termed non-conservative. The 
names originate from the fact that, in the first case, the total me- 
chanical energy is conserved — there is merely an interchange between 
kinetic and potential energy (cf. below) — while in the second instance, 
a transformation to other energy forms (principally heat) takes 
place. 

When forces are conservative, their discussion can be simplified 
by introducing the “ potential ”. If the work done on a particle by 
the field of force during a displacement is independent of the path, 
then we can assign to every point of space the value of the work done 
in a displacement from a “ standard position ” r x to the point r. 
Thus, besides the vector field of the force (determined by three num- 
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bers), we have a much simple! scalar field. For practical reasons we 
take the negative cf the work done, by the field of force and call it the 
Potential TJ. The change in V corresponding to a small displacement 
dr is dr grad <7.* The work done is then 

dW — Fdr — —dU — — (grad U ) dr, . . „ (14) 
i.e. F = = — grad V. ...... (14') 

Every conservative farce may be represented as the negative of the gradient 
of a scalar prAnt function , the potential. For a finite displacement we 
have, by equation (13), 

W — j' Vdrzss \m (v 2 * — vf) - f (pad V) dr — U l - U a . \ (15) 

Denoting the kinetic energy by T, we have 

T* -f = T l + 7j\ =-- const, .... (16) 

This is the Principle of the Conservation of Energy for mechanics. 
V, which hits the dimensions of energy, is called the Potential Energy. 
Equation (16) thus means'. The mm of the kinetic and potential energies 
is constant for conservative forces. 

This energy principle represents a first integral of Newton’s Second 
Taw, since in (16) only » (i.e. dsjdt, the first- derivative) appears. But 
since the Second Law, being a vector equation., is equivalent to three 
scalar differential equations, we need two additional first order equa- 
tions in order to complete the first- step in the integration of the 
equations of motion. 

With a given field of force it is, as a rule, not difficult to write down 
the potential at once. F may then be found by taking the gradient. 
Mathematically, this means the integration of the system 



to which one must have recourse in case of doubt. 

Ess. ttS. A particle is bound to a fixed centre by an elastic force, j.o. one 
proportional to the distance from the centre-. What is the potential correspond- 


5. Central Forces; th® Law of Areas. 

A central fore® is one whose representative vector always points 
toward a fixed point 0. The magnitude of the force may be an arbi- 
trary point function. Naturally, we choose the point O as origin, so 
that F = f{x, y, z)r. 
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In the ease of central forces, an important theorem may be derived 
immediately. This law furnishes two additional first integrals of the 
fundamental equations. We form the vector product of r with each 
side of the equation representing Newton’s Second Law. Since F is 
in the direction of r, and since the vector product of two parallel 
vectors is zero, we have 



The integral may be given at once: 


for differentiation gives 

r dr dr~\ . T tPrl f tPrl „ 

The vector product [r drjdt] has a simple geometric significance. 
The magnitude of [r dr] is equal to twice the area swept out by the 
radius vector in the time dt (fig. 4). If we divide the area for dt by 

dt we obtain the area swept 

over by the radius vector in 
unit time— the so-called areal 

velocity c. Equation (17) ^ 

thus states that the areal 

velocity is constant for motion 

under a central force. This 

is Kepler’s Second Law of 

Planetary Motion. Butequa- , 

tion (17) says more. If the *- r r -* 

product \r(drjdt)] is equal to F, e- ♦ 

a constant vector, the normal 

to the plane determined by r and dr has a fixed direction, i.e. the orbit 
is plane. For this reason equation (17) represents only two scalar 
equations which, however, together with the energy equation, are 
the complete first integral of the equations of motion. 

The vector m is usually called the moment of momentum or 

angular momentum P, about 0. 


Ex. 36. Verify the following generalization of the Law of Areas: if the force 
acting on a particle always passes through a certain fixed axis, the Law of Areas 
holds for the projection of the orbit on a plane perpendicular to the axis. 

Ex. 37. From the equation of the orbit r ~ /(<p) and the areal velocity c de- 
duce the Law of Force. Use equation (9). Application of this line of thought to the 
empirically known Kepler ellipses led Newton to the discovery of the Law of 
Gravitation. 
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6. Gravitational Forces; Planetary Motion. 

Starting with Kepler’s Laws, at that time known as empirical 
facts, Newton derived his general law of attraction, specifying the 
forces which determine the motions of the heavenly bodies. The 
Newtonian Law of Universal Gravitation states that there is a force 
of attraction between any two masses m l and which is directly 
proportional to the product of the masses and inversely proportional 
to the square of their distance apart. That is 

r-y^. (is) 


Up to this point we have encountered the mass of a body only in 
connexion with inertia; here it appears in an entirely different form- 
as gravitational mass (weight). It does not follow that inertial and 
gravitational mass are identical; i.e. more accurately stated, that the 
ratio of inertial to gravitational mass is a constant (contained in y) 
for all bodies. If we assume the two kinds of mass to be the same * 
the constant y has the value 6*66 X 10~ 8 C.G.S. units. 

If we neglect small perturbations, the gravitational field of the 
sun alone determines the orbits of bodies in the solar system. On 
account of its predominantly large mass M, we can consider the sun 
fixed at the origin of a co-ordinate system. Strictly speaking, we 
should investigate the motion about the common centre of gravity, 
which is at rest (of. Chap. VI). The force acting upon the body of 
nuyui m is 



The corresponding potential is 

y^?, 


(19) 

( 20 ) 


as may be verified by forming the gradient. 

Since the force is central, the orbit is plane. We can describe it 
most simply in plane polar co-ordinates with the sun at the centre. 
The first integrals of the equation of motion, the Energy Principle 
and the Law of Areas, may be written down at once: 

J mv* — y = £ mv 0 z — y (Conservation of Energy), 
r r 0 




(Law of Areas). 


* The equality of gravitational and inertial mass Is a fundamental postulate of 
tile Generalized Theory of Relativity. All experimental attempts to detect individual 
differences with various substances have been unsuccessful. 
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If we wish to describe the course of events in time, we must determine 
r and <f> as functions of t, by integrating again. Instead of doing this, 
however, we limit ourselves to the derivation of the equation of the 
orbit described by the body. For this purpose we eliminate the time 
from the two equations. Using 

««=**+rV (c £ p.88), 4 = % 

this is readily accomplished. From the energy equation, 

4c 8 / dr \ 8 4c 2 2 yM _ , 2 yM 

The variables r and <j> are immediately separable and we have 
- dr 


‘TFfP? 7 ? 


If we introduce a new variable u — 1/r, we obtain 
, 2cdu 


. + 2 yMu - 4c 8 w 8 

r o ) 


This integral is of the familiar arc-cosine form 


J Vo -f- 2 bx — hz? Vh 


1 _,/ b~hx \ 

7h co& ~ WW+7h)' 


If we introduce the abbreviation 



and take the cosine of each member, we obtain 
4c* 



P_ 

I •+• « eos($ -f- C)' 


( 21 ) 


We may measure <f> from the mavirmim r, so that C = -n. This gives 
the familiar polar equation of a conic with origin at one focus. This is 
Kepler’s First Law, which states that the orbits of bodies in the soiar 
system are conics, with the sun occupying one focus. 

The orbit is an ellipse, parabola or hyperbola, according as the 
numerical value of the eccentricity e is less than, equal to, or 
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greater than unity. Putting in the values of e and a, we have an 
ellipse if 

2c / . 2 yitf.yLM* 

y MV V ° r 0 + 4c 2 1 


and a hyperbola if | oti> 0 2 >*- . 

r o 

That is, the orbit it an ellipse, parabola or hyperbola according at, 
initially, the kinetic energy is less than, equal to, or greater than the poten- 
tial energy with sign changed. Thus, for an elliptic orbit, the total energy 
is negative. 

The following additional relations may be derived for the case of 
an ellipse. The parameter of the ellipse 

v = b * «. 0 (i _ 6 *) ( a “ semi-major axis\ 
e a \b— semi-minor axis / 

is equal to 4 cPJyM, by equation (21). Hence 

l e a = ^ . .. 

* yMa 

If we denote the period by T, the area of the ellipse is 
cT= wa 2 vT=^7*= -7~- 


This is Kepler’s Third Law: The squares of the periods of the planets 
are proportional to the cubes of their orbital majot axes. 

If instead of using the parameter p to compute 1 — € 2 we use the 
value of the eccentricity itself from.(21), we have, on setting the two 

equal: 

4 o* __ _4c^ / , _ 2 yM /i¥ 2 \ 

yMa yW 2 Y° r 0 + 4c 2 / 

or 

w-rM? ... fas) 


This means that the total energy in an elliptical orbit is a function only 
of the size of the major axis, and all ellipses having the same major axis 
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(including tie circle) have the same energy. This proposition amplifies 
many computations in the older atomic mechanics. Tne total energy 
has a negative sign because of the dominating magnitude of the poten- 
tial energy, which was so specified upon introduction of the potential 
that it vanishes at infinity but is negative at finite distances. 

7. Quasi-elastic Forces and Harmonic Vibrations. 

If a body is bound to a fixed point by means of a spring or a rubber 
band, any small displacement from the position of rest calls forth a 
restoring force proportional to the displacement and directed toward 
the position of rest, which we take as origin. That is, 

F = — hr. 

We shall call all central forces of this form quasi-elastic forces, even 
though, as may well happen, the force is not of an elastic nature (cf. 
Ex. 38, p. 94). The differential equation becomes 

m ^ + hr — 0. ...... (24) 

We might now, as before, use the Principle of Energy and the Law of 
Areas to get first integrals, but direct integration, twice repeated, is 
just as simple here. If we try the solution 

r=ae?", r=AW*, 
wo obtain for A the “ characteristic equation ” 
roA* + k « 0, 

whose roots are 

A — i\l — and A = — * A /-. 

V m V m 

The general solution, containing two constant vectors, is 

/-t/- , _m/I, * 

r= ae v » -f be * (23) 

Using real functions (cf. p. 47), this is 

r~ Acos^^i-J- . . . (26') 

The meaning of the vectors A and B is readily ascertained. For 
t= 0, 
r 0 = A, 
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(£)._-«*- v/s _ 

i.e. r = r 0 cos<J~t+ v 0 ^j~ sin ^ t. . . (25") 

The vector r is thus the resultant of two vectors which have the 
directions of the initial radius vector and initial velocity, respectively, 
and whose magnitudes are simple periodic functions of the time. To 
obtain the orbit, we introduce the unit vectors e r and e v . In this 
oblique co-ordinate system, the co-ordinates of P are 

£=r (jC o 8 ^U T-Vo^sin ^t. 

Elimination of t gives ~ -4 — ; — 1. 

r 0 2 vfin/k 

This is the equation of an ellipse in oblique co-ordinates, the co-ordinate 
axes being in the directions of two conjugate diameters. The general 
orbit of a particle moving under the influence of a quasi-elastic force is 
the ellipse having the initial position vector and the initial vdodty vector 
as conjugate radii, and having its centre at the centre of force. If 
v 0 has the direction of r 0 , the vibration is linear. According to equation 
(25), the frequency of a harmonic vibration is given by 1/2 -tt times 
the square root of the quotient of the elastic constant by the mass: 

-sV! 

Ex. 38. A particle constrained to move on a straight line is subjected to an 
electrostatic force of attraction F 1 = a/a:* and a repulsive force JF S = -f 

P/a: 10 . Calculate the frequency of the vibration resulting from a small displace- 
meat from the position of equilibrium x 0 . 

8. Harmonic Vibrations with Frictional Resistance. 

In addition to the quasi-ela3tic force, let the particle be subjected 
to a frictional force, as is actually the case with vibratory motion in 
practice. Let us assume that th» resisting force is proportional to 
the speed of the particle. A resistance of this kind exists when the 
particle moves at low speeds in the air or in any other fluid. Friction 
between solid bodies is, on the contrary, independent of the speed, 
within wide limits. Whatever the law of dependence of the resisting 
force on the velocity may be, this force is always opposite in direc- 
tion to the velocity of the particle. The total force acting on P is there- 
fore 
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We limit ourselves, in what follows, to rectilinear motion, and call 
the co-ordinate x. Then the equation of motion is 

W S~Mt + * X==0 (27) 

To integrate, we again try 

x = ae^\ 

which leads to the “ characteristic equation ” 
mA 2 ~j- )3A -j- k = 0. 

Its roots are 

K — A+ Jgn and A J£T> 

2m y 4m 2 m 2 m y 4m 2 m 

Two cases must be distinguished here: 

(a) /S 2 > 4&m. In this case only real quantities appear, and the 
general solution for fP > 4fcm is 



In this aperiodic case, corresponding to large damping, the particle 
gradually returns to its equilibrium position, which it reaches at t = oo. 

Even the limiting oase p 2 — 4 bn results in an aperiodic solution. Superficially, 
it would seem that a double root of the characteristic equation would yield only one 
oonstant of integration. However, it is seen that here bta x ‘ is also a solution, so 
that the general solution is 

* « •“ £*(• + *) = + bt ) (29) 


(b) jS 2 < 4 Jem. There are now imaginary exponents: 



Applying the Euler formula, and introducing new constants of in- 
tegration, we obtain the real form 


x - (A sin 


y m 4m 2 /’ 


or 


»-c«-£‘oo 


( 30 ) 
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We again have a harmonic 
vibration whose amplitude, 
however, decreases exponen- 
tially (cf. fig. 5). This damped 
vibration has the following 
important properties: The 
frequency 

= j_ IK7E 

v 2ir V m 4 m a 
is less than 


,1 /* 



that of the undamped vibration. The ratio of two successive maximum 
deflections in the same direction is 

• ' (31) 

The natural logarithm of this ratio is called the logarithmic decrement S. 
If the damping is small, we make no serious error by putting the period 
of the undamped motion in place of the period T. Thus is obtained the 
simple expression 



Note: In mechanics, especially in treating the problem of the pendulum, the 
period is often takeij to be half the quantity here designated by T, It is thus 
taken as the time between an extreme deflection on one sideband the next one 
on the opposite side. Correspondingly, the logarithmic decrement is often referred 
to the half-period. 


9. Forced Vibrations; Resonance. 

In addition to the forces previously taken to operate during a 
harmonic vibration, let us assume that a periodic external force is 
present. This force may have the form 

F a F q cos oh. 

With these three forces — quasi-elastic, frictional, and applied— -the 
equation of motion becomes 

w* ^ + P ^ + fee — F 0 cosoit, . . . (32) 

Introducing the characteristic angular frequency of the free, undamped 
vibration, u> 0 * = kjm, we have 

d?x 0 dx , , F 0 

3e + m af + “> V: “*“*“*■ 


(32') 
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To integrate this equation we use a theorem from the theory of linear 
differential equations, which states: If g(t) is a particular integral of 
this non-homogeneous equation, and if/(f, A, B) is the general integral 
of the corresponding homogeneous equation, 


_ T £*!4 
<fe 2 m ( 


then the sum g(t)+f(t. A, B) is the general integral of the non- 
homogeneous equation. On account of the linearity of the differential 
equation (82') — i.e. the absence of products of the dependent variable 
and its derivatives— the sum y(t) +/(t, A, B) satisfies the equation, 
as may be seen by substitution. Since this solution contains two con- 
stants of integration, it is also the general solution. The integral 
of the homogeneous equation is known from the preceding paragraph, 
and so we need only find a particular integral of the non-homogeneous 
equation. Since the force F a is a periodic function of angular fre- 
quency co, the trial of a periodic function of this kind is suggested. 
But since the first as well as the second derivative of x appears, it is 
evident that a sine or a cosine function alone cannot satisfy the equa- 
tion. We thus try 

x — p cos cot -f- q since! = \/p 2 + cos (cot — - <f>) 


where the coefficients p and q are still undetermined. If we substitute, 
in equation (8 2'), the condition that the coefficients of costef and of 
sin cot must vanish leads to tho following equations for p and q: 




m’ 


~ p-q(W 

From these we find 

mjF 0 (o) 0 2 — co*) 

P m*K*-co 2 )*-M 

vy+?= 


-co*) = 


9 = - 


m 2 (co 0 2 — co a ) 2 -F/Pto 2 ' j 

tan^ = ^=— r-~ — . j 
8 p wi — o> 2 ) j 


( 33 ) 


The general solution for small damping [Case (b), p. 95] ii 
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The first part represents a damped vibration having the period of 
the characteristic free vibration of the damped system, i.e. it cor- 
responds to a transient phenomenon determined by the initial con- 
ditions, which determine 0 and y. This part dies out after some time. 
The second part represents a permanent vibration of the same period 
as the exciting force. This is the steady state. For stronger damping 
[Case (a)] the first part becomes aperiodic, while the form of the second 
part remains unchanged. 

We could have found the particular integral which describes the 
steady state much more simply by using the vector diagram (p. 52) 
which always leads to a result rapidly in problems involving the de- 
termination of the permanent modes, or stationary states, in vibration 
processes. If we write for the exciting force, F a — F 0 e' wt , and take 
the solution to be of the form 

then, by equation (32') (p. 96) we must have 

-f £ im + to 0 Ae' < “ t ~* > = — a'"', 
m / m 





the addition of the complex numbers, 
or vectors, on the left (fig. 6) gives a 
sum of modulus 


/ (**«-«*)»+£ 


from the vector diagram (fig. 6). 
This has to be equal to F 0 jm, i.e. 



■\/TO 8 (ft> 0 2 — -m 2 ) 2 - f /^w 2 


(33") 


The two complex members (vectors) of equation (33') must agree 
direction as well as in magnitude, i.e. 

tan^ — 


m(u) e 2 — oj 8 )’ 


(33'") 


Thus the difference in phase between the exciting force and the forced 
vibration, as well as the amplitude of the latter, both depend upon 
w 0 a — to 2 . The phase difference appfoaches zero for the case of very 
slow vibrations, and increases to w/2 for u> ~ ou Q (resonance; see be- 
low), becoming n for infinitely rapid oscillations. When co is given, the 
amplitude is greatest when u> 0 — <w; i.e. when the frequency of the 
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Fie- 7 

a 2 from equation (33"), it follows 


exciting force coincides with 
the natural frequency of the 
undamped system, not with 
the natural frequency of the 
actual system. In this case we 
speak of resonance * In the 
absence of all damping the 
amplitude becomes infinite at 
resonance. To show how the 
amplitude depends on the fre- 
quency of the exciting force, 
i.e. to draw the so-called 
resonance curve, it is con- 
venient to choose the square 
of the amplitude as ordinate. 
If we use the value for 


o 2 — 


El 

™ 2 K 2 - CO 2 ) 2 + 



Introducing the “ discord ” 10 — w 0 as independent variable x and 
putting, as an approximation, 

-j - — 2a>, a> 0 * — w a — — 2a>x 

in the neighbourhood of the resonance point, we obtain this simple 
equation for the resonance curve (fig. 7): 


P 

q 2 m „ T 4 mPx 2 -f- 


For 


_q*__ 1 = jS 2 
o 2 m « 2 4m 2 x 2 + p? 


or 


0 ) — - 


2^ == f == fe 0 (°f- equebion f3i)). 


• (35) 


We therefore have the theorem: Half the interval between the two fre- 
quencies for which the square of the amplitude has half its maximum 
value, these frequencies being measured in terms of the free undamped 

* Strictly, this is the velocity resonance (cf. Ex. 39). 



100 MECHANICS [Chap. 

frequency as unit, is approximately equal to the logarithmic decrement 
divided by 2m. That is, the weaker the damping of the system, the 
sharper the resonance. If we introduce the same simplification into 
(33'"), we see that the phase differences at the positions of half-value 
are tt/4 and Srrji respectively. 

Ex. 39. If the equation of the resonance curve is calculated without resorting 
to the approximations made in the text, it is found that the maximum of the 
square of the- amplitude does not occur at the natural frequency of the damped 
motion, nor at that of the undamped vibration. It is true, however, that the 
particle has its maximum energy (defined as the kinetic energy while passing 
through the zero- point) at <o = The maximum of the time average of the 
energy comes at yet another place. Compute the positions of the maxima of the 
specified quantities. * 


10. Non-harmonic Vibrations; Sudden Changes of Amplitude. 

Rigorously quasi -elastic forces, which are exactly proportional to 
the amplitude, even when it is very large, do net occur in nature. 
The actual binding forces are to be represented by power series in 
powers of the displacement. In most cases, a single additional term, 
along with the first power, is sufficient. Now there are two essentially 
different kinds of possible laws of force. If the force is symmetric, i.e. of 
equal magnitude at corresponding points on both sides of the position of 
equilibrium, or rest-point, then only odd powers may occur in the law of 
force; otherwise we have an unsymmetrio law of force, and hence an 
unsymmetric vibration. We restrict ourselves here to the more com- 
mon case of symmetric vibration, and add a term -(-er 3 to the quasi- 
elastic force —lex. We further assume that e is positive, which means 
that the binding becomes looser with increasing amplitude. The 
equation of motion for free vibrations of this system may be integrated 
readily; however, the result is expressible only by means of elliptic 
functions. The important fact is that the natural frequency decreases 
with increasing amplitude. This becomes evident qualitatively if wo 
consider the actual restoring force replaced by a mean quasi-elastic 
force. On account of the form of the law of force, the restoring force 
will certainlv become smaller with increasing amplitude x, and so, 
by equation (26) (p. 94), the natural frequency will also become less. 
The behaviour of such a system under the influence of a periodic 
external force is very peculiar. Here, even with no damping, the ampli- 
tude cannot increase without limit, for if resonance exists for infinitely 
small amplitudes, an increase in amplitude must cause a departure 
from the natural frequency, so that resonance no longer exists for larger 
amplitudes. The amplitude in the stationary state may be calculated 
by the following approximate method. The differential equation is 
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m — 4- kx — ex 3 — F 0 cos cot. . . , (36) 

The stationary solution is, at any rate, a periodic function of the time 
having a fundamental frequency co/2-it, and is thus representable by a 
Fourier series in multiples of u>. As a first approximation, we begin 
with the fundamental frequency and set 
x 1 — a cos cot. 

The amplitude a i6 still to be determined. If we substitute this 
expression for x in equation (36), and make use of the trigonometrical 
formula 

COfdcot = l cosScot ~t~ f cos cel!, 
we obtain the equation 

(—maojP -f- Ao — | ear) cos cut — - o 3 cos Scot — F 0 cos cot. 

If the fundamental vibration is to satisfy equation (36), we must have 

|i a!+( ^_ tao>+ J=0. . . . (37) 

where we have put co 0 for Vkjm, the natural frequency at infinitesimal 
amplitude. The determination of the roots of this equation will be 
discussed below. At this point we wish to show how the next approxi- 
mations are obtained. If equation (36) is solved for cPx/tft 2 , and if 
the first approximation is substituted in the right member, there 
results 

c Px F„ , Tea , , ea* ea 3 

= — coscui cos cot + 4 — cos cot 4 - - — cos Scot. 

dt* m m m 4 m 

By (37), this is equal to 

— to 2 a cos cot -f- cos 3 cot. 

4 m 


Integration gives 

x 2 = o coscei — 


a 3 

36«ia) 2 


cos Scot 


= a cos cot — cosScot (38) 

36 ho, 2 

This second approximation may then be substituted in equation (36). 
In this manner, any number of terms of the Fourier series may be 
calculated. The convergence of this process requires a separate 
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investigation. It may be said here, however, that the series converges 
for e small. 

It remains to discuss the solution of the cubic equation for a . We 



obtain (fig. 8) a graphically in a y, a co-ordinate system as abscissa of 
the intersection of the cubical parabola 
3eo 3 



with a straight line 



If we start with a> large, the slope of the straight line is negative, and 
there is only one point of intersection P Q . There is also but one inter- 
section for o> = oj 0 . If, now, « 
decreases, the straight line assumes 
a direction such that there are 
three points of intersection P lt 
P t , P 3 > corresponding to three 
possible amplitudes. The reso- 
nance curve has the distorted form 
shown in fig. 9. A more thorough 
analysis shows that if we approach 
from the low-frequency side, the 
amplitude corresponding to the lower branch is the stable one. As a> 
increases we arrive at the limiting point G; from this point onward, only 
the upper branch yields a real point of intersection in fig. 9. Thus, with 
continuous increase of cd, the amplitude will jump suddenly from the 
lower branch to the upper, at or before Q. Such discontinuities (jumps) 
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in amplitude, attributable to the non-harmonic nature of the restoring 
force, are frequently observed in vibration processes, both mechanical 
and electrical. 

Ex. 40. Treat in similar manner the nnsymmetrie non-harmonio vibration 
having V — — kx -f- ex 2 as the law of force, and determine the mean value of 
the displacement. 

11. Mechanics of a Constrained Particle. The Simple Pendulum. 

The motion of a particle is often limited by geometric conditions. 
If, for example, we consider a small mass hanging by a thread of length 
l, its motion is restrained to the extent that it cannot move beyond 
the surface of a sphere of radius l. The whole interior of the sphere, 
however, is accessible in this case. The geometric condition would be 
represented by the inequality I r | l. But the commonest case is 
one in which the motion is confined to a surface or to a curve (i.e. the 
intersection of two surfaces). The condition is thus given by one or 
by two equations. In the last analysis, however, the physical attain- 
ment of these geometric conditions is realized by means of “ forces ”. 
In the given example, these foroeS are the molecular forces between 
the particles of the thread and those at the junction of thread and 
weight. However, we need not concern ourselves with the atomic 
interpretation of these constraining forces. It is sufficient to replace 
the geometric condition by an equivalent constraining force in order 
to reduce the mechanics of a constrained particle to the case of the 
free particle. 

An essential characteristic of all constraining forces is that they 
are perpendicular to the surface to which the motion is confined by 
these forces. If this were not the case, the constraints alone (i.e. the 
mere existence of a surface upon which the particle rests) could set it 
in motion without the action of externally impressed forces. This 
is contrary to everyday experience. For example, this would mean 
that a sphere resting on a smooth horizontal plane could set itself in 
motion. It is true that a tangential force may exist if ( the surface is 
rough, i.e. if there is friction between the particle and the surface; 
but this frictional force is inherently not a constraining force. If the 
magnitude of the constraining force is Z, and if n is the unit normal 
vector to the surface, the equations of motion become 
(P-ir 

m d W-*+ Zn ( 39 > 

If the equation of the surface is G(x, y, z) — 0, then the gradient of 
the function G also has the direction of the Dormal. We may therefore 
write the constraining force as v 

Z — A gradC?, 


(40) 
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where A is a scalar quantity, as yet undetermined, and depending 
upon the co-ordinates. Thus the equation of motion is 

(72 y 

m-^ = P+ AgradG, (41) 

or, in components, 

m£^F w +X~, my ~ F,+ a|~, ml = F, + A (41') 

The quantity A has entered as a new unknown. But since the equation 
of the surface <? = 0 must also be satisfied, there are as many equations 
as there are unknowns. The same is true for the case of a particle con- 
strained to move on a curve which is the intersection of two surfaces 
ff 1 = 6 and G t — 0. The system of equations then consists of the 
equation of motion 

mr — F + A x grad 6?, + Aa grade's,, . . (41") 

containing the two unknown quantities Aj and Aj, along with the 
equations of the two surfaces. 

Since the motion is confined to the surface at all times, every 
element of path dr is perpendicular to the constraining force. For 
thiB reason no work is done by the constraints, and they do not con- 
tribute to the energy balance. In particular, if we are dealing with 
conservative forces, the equation of energy has 'the form derived for 
the case of the free particle: 

tf=<7. (42) 

If a given problem involves but two co-ordinates— the case of motion 
in a plane — which are further connected by the equation of a curve to 
which the motion of the particle is confined, then the energy principle 
itself is sufficient for the integration; for one co-ordinate may be 
expressed in terms of the other by using the equation of the curve and 
the resulting derivatives, giving a differential equation in a single 
variable. 

An instance of this kind is furnished by the simple, or plane mathe- 
matical pendulum. We understand this to mean a particle m fastened 
to one end of a weightless thread of length l. The thread restricts the 
motion of the particle to the inside of a sphere. We confine the dis- 
cussion to plane motion, which may be realized, for example, by 
drawing the pendulum aside a given distance and then releasing it 
from rest. The plane of the motion is then determined by the initial 
position and equilibrium position (vertical) of the thread. The path 
still remains plane if the particle is given an initial velocity along the 
tangent to the circle cut from the sphere by the above plane. 

On the other hand, the orbit is a space curve if the particle receives 
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an impulse the line of which does not lie in the plane. The conser- 
vation of energy alone is not sufficient for the solution of the motion 
of & (space pendulum; an additional first integral, of the equations of 
motion is needed. In this instance, the generalized Law of Areas 
(cl. Ex. S6, p. 89) will serve. This case will not be treated further here. 

Returning to the discussion of the plane pendulum, the conser- 
vation of energy may be stated — using polar co-ordinates* in which 
the constraint is expressed simply by r == l, 

1 4- mgl(l — cos <f>) = C, . . . . (43) 

the potential of gravity being given by mgz. As long as we rule out the 
case where the pendulum turns completely over, there are always two 
points at which the motion reverses, and where <t> becomes zero. If 
we call the maximum amplitude, corresponding to such a reversal 
point, the angle a, then, since (43} gives for this point 

mgl( 1 — cos a) — G, 


and the energy equation becomes 


(cos a — cos <f>) = 0. 

.... (44) 

The integration is not difficult for small amplitud 
the cosines in series, and drop ail terms beyond the 

ea. If we develop 
squares, we obtain 

£-Vf (“*-«■ 

from which 

it - /** 

V — ff> 2 V l 

.... (45) 

Integration giires 


da- 1 (|) == \/| t 4- & or <j>~ a 

: -r A . (46) 


The constant of integration h may be made zero by specifying that 
; -0 when — 0, i.e. when the particle passes through the position 
of rest. We thus have a simple harmonic vibration of frequency 

-2vVf <47) 


The period is thus 


( 47 ') 
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The actual equation of motion may be obtained by differentiating 
equation (43) with respect to t\ this equation might also have been 
derived directly. It is 

^ + |sin^ = 0 (48) 

The first approximation putting sin <f>= <j> gives the differential equa- 
tion of simple harmonic motion. If the sine is developed to one addi- 
tional term, we obtain an equation exactly like that of non-harmonic 
vibration, which was treated at length in § 10 (p. 100),* 

^ + ( 49 ) 

The pendulum, then, represents, for moderate amplitudes, a simple case 
of non-harmonic vibration. The rigorous integration for arbitrarily 
large amplitudes leads to elliptic functions. Thus, if we start with 
the energy equation (44) (p. 105) and replace coa</> by 1—2 sin 2 (<f>/2) 
and cos a by 1 — 2 sin* (a/2), we obtain, after separation of the 
variables. 



This is the exact equation corresponding to (45) (p. 105). 

The integral is elliptic, and cannot be expressed in terms of elementary func- 
tions; it is, however, readily brought into the normal form of an elliptic integral 
of the first kind. If we put 

sin | = sin | sinu, (51) 

we obtain 



Integrating between < = 0 and t=t, with the initial condition that for t «= 0, 
9=0 and therefore u = 0, 



This integral is evaluated in many tables, where it is given in terms of u and 
a/2. See, for example, E. Jahnke and P. Emde, Tables of Functions (New York, 
Dover Publications, 1943). The graphs on p. 61 of this book show vividly the 
departures from the limiting case of a = 0. . 

* There we also took into account the presenoe of a periodio external driving 
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The inverse of this integral is the Jacobian Function 



The function sn (sine amplitude) is given in tables, with the help of which the 
oourse of the solution of (53) may be traced. 

We inquire, finally, the magnitude of the restraining force which holds the 
particle in its ciroular path. Aooording to Newton’s Third Law concerning the 
equality of action and reaction, this foroe is equal to that with which the partiole 
pulls on the cord. Since it is normal to the given surface, it must be radial. In 
the neighbourhood of the equilibrium point, the constraining force is oertainly 
directed towards the oentre. 

The magnitude is readily found by writing the equation of motion obtained 
by resolving along the normal. Taking, as usual, the direction toward the centre 
of curvature as the positive direction of normal acceleration, we have 

ml = = —mg cos 9 + Z. 

Putting for 9 * its value from the energy equation (44) (p. 105), we obtain 

Z = mg( 3 cosip — 2 cos a) 154) 

The constraining force Z changes its sign when 

3 oosip m = 2 cos a. 

This equation has a root which is compatible with the condition that q> < a 
only when a > tt/ 2. This means that as long as a does not exceed rc/2, the thread 
remains taut. Arbitrarily large amplitudes are, however, not possible with a 
pendulum consisting of a particle suspended by a thread,~for there is a point 
(viz. where tp -= <p m ) where the pendulum collapses. If we take, for example, 
an amplitude a= it (this means that the particle just reaches the top of its 
circle) we have 

eos<p m = — f, ip^ = 131° 49'. 

The reason for this peculiarity is that a thread does not secure the geometrio 
condition r — l completely; it excludes points for which r > l, but not points 
for which r < l. A satisfactory mechanism is provided by replacing the thread 
by a rigid, weightless rod. 

Ex. 41. F'W what curve is the period of plane pendulum motion independent 
of the amplitude? [Introduce the length of arc s as a function of z = l (1 — cos 9 )]. 

Ex. 42. A motor-cyclist is able to take a curve of 10 m. radius on a flat, 
unbanked road at 20 km ./hr. without skidding. With what speed must he ride 
on the vertical inside surface of a ciroular cylinder of 5 m. radius, made of the 
same material, in order that he be able to ride in a horizontal path^without 
slipping down? Recall that the traction force due to friotion is proportional to 
the normal pressure and independent of the speed. 
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General Theorems on the Mechanics of Systems 

of Particles 

We represent a physical system schematically by means of a num- 
ber of particles and apply the results of Chapter V to the individual 
particles. From these results we can. deduce laws referring to the 
behaviour oi the system as a whole. The range of validity of these 
laws is almost unlimited, since we can, in the last analysis, consider 
any body to be composed of a system of enormously many particles — 
electrons and protons. The forces acting upon a single particle are of 
two kinds: either they originate in the other particles of the system— 
internal forces — or they have their origin outside the system — external 
forces. Newton's Third Law of the equality of action and reaction is of 
especial importance here. We understand this law to have the follow- 
ing meaning: If a particle exerts on P & an attractive force directed 
toward P l5 then it is at once evident that 1\ attracts P x toward P 2 
with the same force, i.e. the force vectors of the two internal forces 
resulting from the mutual attraction are equal and opposite. 

1 . Theorem concerning the Motion of the oeuvre of Mass. 

We start with a system of N particles. The &th particle is subject 
to the following forces: A number of external forces which we replace 
by their resultant F*; further, the force F ii: due to the presence of the 
first particle, F 2 k from the second, ike. — in general, P a from the ttb 
particle. The equation of motion for the fefch particle is thus 

• • • • • CD 

There are N such equations, one for each particle Imagine them all 
written down and added together: 

.Viy 

= + .... ( 2 ) 

Since internal forces havmg both subscripts alike do not exist, accord- 
ing to our notation, the combinations k — i are to be excluded, from 
the double sum. Now for every force — the force exerted by the 
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jth particle on the Ath— there corresponds a force F w — that exerted 
by the &th particle on the jth — and these two forces are equal and 
opposite. Hence the double sum vanishes, and the internal forces of 
the system cancel out in the summation. There remains in the right 
member only the vector sum of the external forces acting on the indi- 
vidual particles. We now define the centre of mass of a system to be a 
point whose radius vector r (referred to an arbitrary centre) multiplied 
by the total mass of the system is equal to the vector sum of the pro- 
ducts of individual radius vectors of the separate particles with the 
corresponding masses: 

Mr — T. k m k r k (3) 


If we substitute this expression in equation (2), we have the theorem 


(4) 


The centre of mass of a system moves as if the entire mass of the system 
were concentrated C'jtre, with the resultant of the externally applied forces 
acting at that point. In particular, if there are no external forces, the 
centre of mass remains at rest or in a state of uniform rectilinear 
motion. As is well known, this theorem is the basis -of the explanation 
of recoil phenomena. For example, if a shot is fired from a cannon 
standing upon a smooth horizontal plane, then the gun must spring 
back with a velocity such that the common centre of mass of cannon 
and projectile remains in the vertical line through the point of firing; 
for, neglecting friction of the gun with the ground, the only external 
force is gravity, which has no horizontal component. 

Since the most universal external force is that of gravity, the centre of mass 
is commonly referred to as the centre of gravity. Another name for it in equally 
general use is the centre of inertia. The following elementary considerations are 
useful in determining this point: If r is the radius vector of the centre of gravity 
of two particles m t and ra 2 , then 

(nij + m t ) r - to, r x + m 2 r 2 , 
or mi(? — rf) = m t (r t — r); 

this means that the vectors r — r t and r 2 — r are parallel. But since they have 
the terminus of r in common, the three points m,, m t and the centre of gravity 
are eoUinear. The position of the centre of gravity is determined by 
| r — r, | m t 

We thus have the rule: The centre of gravity of two particles m, and m s divides the 
distance between the particles in the ratio of the two mosses, the centre of gravity 
being nearer the larger mass. If, now, a third particle be added to the system, 
the centre of gravity of the set will be the centre of gravity of m 8 and the original 
centre of gravity, where both m, and m t may be considered concentrated. It is 
readily seen that the centre of gravity, found in this way, is independent o# 
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the order in whioli the particles are taken. The procedure is simitar for additional 

pB-ltriciOB. 

Ex. 43. Two particle® m, and m, as* ire equilibrium when at a distance a 
Apa<rv. If fcfce distance be altered a small amount a, there arise restoring forces 
pi'opnrtionftl to x. C&lcukta the characteristic frequencies of the system. 


'£. Aaguiar Momentum of a System of Particles. 

AO'toidii'g to § S, p. 88, the angular momentum (or moment of 
momentum) ff —■ kj [rr ] of a particle is equal to 2tn times the “ &real ” 
velocity of the radius vector. The vector eum Hm k [r k r k ], called the 
toted angular momentum (or moment of momentum ) of a system of par- 
ticles, is one of the most important quantities in mechanics. We shall 
now investigate the properties of this quantity. For this purpose we 
multiply the equations of motion for a particle vectorially by r k and 
sum over all the particles. The result is 

[n- 5*] , . m 

As shown at p. 89, the left member represente the time derivative of 
the quantity that is, of the total angular momentum. 

Further, the vector product of the radius vector to the point of appli- 
cation, of a force by the force vector is called the moment of the force 
F s . We denote it by K*. The magnitude of M* corresponds to the 
product of force by lover arm, in relation to burning about 0. The 
total moment of the external forces is gives, by £[r*FjJ. The second 
tern in the right member of (6), which represents the resultant of the 
moments of the internal forces, vanishes if the internal forces between 
two particles have the direction of the line joining the particles, he. if 
the forces are central Thus, since W jk = — F w , we have for any pair 
of particles 

[r^«]+[r,F«J-[(r & -r,)F,*}. . . , (6) 

But the vector product on the right vanishes, since we arc assuming 
that is in the direction of r* — Ste. Thera remains, therefore, 

A ,•? p» 

^-2tf ***.]•- * > * <7) 

For a system of particles in- which the forces between any two particles 
are in the direction of the line joining these particles, the rate of change 
of the total angular momentum is equal to the sum of the moments of (hi 
applied forces. 

The limitation made above is actually of iitfie importance. .Float considera- 
tions of symmetry, it is difficult to imagine a fore® aoting between two points 
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fcicn, it mast bo remembered that this law deala with the force between a magnet* 
pole and an elementary segment (i.e. not a point or particle) of an electrical con- 
ductor. 

In the particular case in which there are no external forces acting, 
or if the total moment of the forces vanishes, then, according to equation 
(7), the total angular momentum of the system remains constant. In 
this form the theorem explains a large variety of phenomena of every- 
day life, e g. the method by which a child sets a swing in motion, the 
ability of a falling cat to right itself before landing, the familiar turn- 
table experiments, &c. This law finds one of its most beautiful appli- 
cations in explaining the Einstein-de Haas Effect in. Magnetism (cf. 
p. 464). 

In general, the value of the total angular momentum depends upon the choice 
of the reference point O, If. however, the centre cf gravity of the system ic at 
rest, this quantity becomes independent of the choice of 0. If we denote the 
radius vector to a new centre O' by r B and a radius vector emanating from O' 
by r k ', then 

r y =■- r 9 ~ r k '. 


The angular momentum, referred to 


and referred to O’, 


P = 
V = 


0, is 

£,v"ii>a>s]. 


If we put r t -|- r k ‘ for r k in F, we have, on account of r t ** 0 a 

F - Ej^Kr, -5- r k ’)r k '} - -<- K 

But the first term vanishes if 

2 %?= 0 , 
dt 

i.e. if the centre of gravity is at rest. 


3. Total Energy of & System cf Particles. 

Let us multiply the equation of motion of the Mh particle soalarly 
with dr jit, and sum over all the particles. Then 


v &r k dr k a._ idryV 
S * w * W ~dt = di ¥Kmh \~u) “ 


dr k 
* <U ' 


(8) 


Integrating between the times i r . and t 



The left member represents the total change in kinetic energy of the 
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system, the right member gives the work done by the internal and 
external forces. But it is by no means the case that the work done 
by the internal forces cancels out in calculating the energy, as one 
might expect it to do in view of the results of the two preceding 
sections. 

The kinetic energy may be divided into two parts, each of which 
has a physical meaning. If we introduce a second co-ordinate system, 
whose origin O' is at the centre of gravity of the systdfea, and if we 
denote all radius vectors referred to this system by primes, we have 

r*=r+ rf. 

Then, identically, 

- <§)■*.“* + f M 

The second summand on the right vanishes, however, since Hm k r k jM 
is, by equation (3) (p. 109), the radius vector of the centre of gravity — 
and this, by hypothesis, is zero in the primed co-ordinates. The first 
term on the right represents the kinetic energy of the system, con- 
sidering the entire mass to be concentrated at tue centre of gravity. 
The lasfc.fcerm gives the kinetic energy of motion of the system referred 
to the centre of gravity, when considered as at rest. Thus, we may say: 
Tke total kinetic energy is equal to the translational kinetic energy of the 
entire mass, considered concentrated at the centre of gravity, plus tke 
energy of motion of the parts of the system relative to the centre of gravity. 

We further assume that the internal forces are such that they are 
derivable from a potential. The potential of the force operating 
between the points j and k is a function of the distance between the 
two points, and therefore of their co-ordinates: 

U lt = U ik (r ik ) = U ik (V (x, - xff 4- (y, — y k f + (z, -^p). (11) 


The force acting on k is obtained by taking j to be fixed, and con- 
sidering k to move in the potential field given by the point function 
U ik ; i.e. we consider the co-ordinates of j to be fixed, those of k to be 
variable. Then, 


F,* = 


.dv jk , dV jk ,du jk _ 

1 8x k ^ dy k 3 z k 


— grad* TJj 


(12) 


in like mannei-, 

.dU }k 


F w = - 


J dy, dz, 


grad, O jk 


-F*. (13) 
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The work done in causing Small displacements of j and k is 
F ' ik dr k + F k ,dr, = — (°^J*dx k 


, , 317*, dv ik 

dy k +~d* k + 1 ^ t 


+t: iy ’ +d -k il )'’- dv “ 


The negative of the sum of F jk dr k and-F kJ dr t is therefore obtained by 
forming the total differential of U lk , defined as a function of the six 
co-ordinates of the two points, in (11). If, then, we wish to introduce 
the internal potential into the right member of equation (9) (p. Ill ), 
we must write. 

. (15) 

It is readily seen that the factor 1/2 enters: If we start with point 
1, and calculate the mutual energy TJ ik between this and all the other 
points, k runs from 2 to N; but when we take point 2, we must start 
counting with 3, since the mutual efiect of points 1 and 2 was already 
taken into account in dealing with point 1, and so on. Thus, in ex- 
tending the summation over all combinations j and k, we must divide 
by two. 

If the external forces have also a potential, the energy equation (9) 
becomes 

T + X k U le + $Z k Z i U jk 

= r<°> + S* U k w + p)*S, ty°> = const, (j 4= k), (16) 

where T denotes the kinetic energy. The sum of the kinetic energy 
and of the external and internal potential energy of a system is constant, 
if the external as well as the internal forces are conservative. 

Ex. 44. A solid sphere of radius r and density p rolls without slipping down 
a plane inclined at an angle a to the horizontal. What is the velocity of its oentre 
after rolling a distance s, and how does this velocity compare with that which 
‘would he attained by the sphere in sliding without friction the same distance? 

Ex. 4S. Give the theory of the central impact of two spheres, masses m l and 
m t , whose velocities are along the line joining their centres: (a) for the case of 
perfectly elastic spheres, where no mechanics! energy is lost in the impact (i.e. 
converted into heat); (6) for a case of inelastic impact in which mj acquires the 
internal energy e. 


4. The Principle of Virtual Displacements, d’Alembert’s Principle and 
the Lagrangian Equations of the First Kind. 

Let any number of forces be applied to the feth particle of a system. 
Assume that the individual particles are completely free, i.e. there are 
no constraints imposed by geometric conditions. Then the condition 
that the system is not to move under the influence of the forces is 
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evidently the condition for equilibrium , viz. that the resultant W k is 
to vanish: 


If we imagine each particle displaced from its rest position by a 
vector 5 r k , then T.F k Sr k represents the total work done in such 
a virtual displacement. Since each F k vanishes, this sum is evidently 
zero. If a system of free particles is in equilibrium, the total work done 
by all farces, internal and external, m a small virtual displacement is zero. 
If the forces have a potential XJ — EH*, then 

-JESJ7* = -W = 0. . . . (17) 

The vanishing of the variation W means, however, that the function 
has a stationary value. In a state of equilibrium the potential energy of 
a system of free particles has a stationary calm. 

At first sight these considerations seem trivial; their utility first 
appears when limitations are imposed in the form of geometric con- 
ditions; e.g. the distance between two particles is to remain constant, 
or certain particles are to be confined to given surfaces, &e. The con- 
straining forces, as was shown in Chap. V (p. 104), can do no work. 
Consequently the principle of virtual displacements holds also for the 
equilibrium of a system with constraints, provided we understand that 
the F k are the actual given forces. On the other hand, it is no longer 
true that F* must be zero; on the contrary, an applied force may be 
balanced by a constraining force. For example, the force of gravity 
"acting on a mass which rests upon a horizontal supporting surface is 
balanced by the reaction of the surface. This difference between a 
free and a constrained system is expressed mathematically by the 
fact that the displacements Zr k are no longer independent, but are 
such that certain auxiliary conditions are satisfied. If we write equation 
(17) in terms of the co-ordinates, which we designate by consecutive 
numbers instead of by x, y, z, then the sum 

iM’.-Szi » o (17') 


has 3 N terms, for a system of N particles. Each equation of condition 
reduces the number of 3N arbitrary variables fey one. For example, 
if one equation of condition exists, we can express the last displace- 
ment in terns of the others, and substitute in equation (17), which 
will then contain only independent displacements. However, the 
following procedure, devised by Lagrange, affords a much better 
general view of the process: let the conditions be given in the form 
of finite equations connecting the co-ordinates x t { hohnomous systems), 
which are independent of the time ( sderonomous systems). 

Let there be l equations of condition. The Icth of these, 

*8, ... aw) = 0, 
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gives upon differentiating, 

dfj. 


dxt 


8x * + • * • frr 8 *" 


a*a 


3/* d 

3^; 


(18) 


If we multiply these equations by certain provisionally undetermined 
multipliers A fc , aDd add them to equation (17'), we have 




(f, + A,g+A a |+...A,g i )&,-°. . (19) 

In this sum of 3 N terms we can Belect the multipliers in such a way that 
the last l terms vanish. The remaining (3JV — l) terms then contain 
only (BN — 1} displacements, i.e. as many as there are independent 
quantities Bx ( present. However, since these 8a;< are independent of 
each other, their coefficients must vanish separately, in order that 
the sum be zero. For we may select the purely arbitrary displace- 
ments in such a way that, for example, all the Sx t except Sa?„ are 
zero, in which case the only remaining member of the sum is 


of equations 





r hypothesis, Sx h is to be different from zero, the 
brackets must vanish. We thus obtain the system 


3aq 


*■+*■§!+•• 

.»,36-o 

3*2 

, ... (20) 


■ A %_0 

0®ts j 



Despite the fact that these equations are all of the same form, they 
have different origins. I of these equations hold because we purposely 
determine the l quantities A k in such way as to make them true; the 
remaining (3 N — I) equations are true, however, because the deter- 
mination of the A*. in this way makes the (3 N — l) displacements 
independent of one another. Besides these 3 N equations, we have 
also the l equations of condition, so that the number of equations 
is equal to (31V + l), which is the same as the number of unknowns, 
viz. the 3 N co-ordinates x ( of the particles, and the l multipliers A<. 
On the other hand, if we take the constraining forces Z t which em- 
body the geometric conditions, instead of the conditions themselves, 
we then have a free system for which, from above, 

F t + Z t = 0 . ....... 


( 21 ) 
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Comparison with equation (20) shows that the constraining forces are 
represented by 


2 < = a '§£ + ^ + "^ 




dx/ 


Thus far we have been occupied only with statics, or equilibrium, 
but dynamics can be treated in a quite similar way. Denoting the 
resultant of the forces applied to the /cth particle by F*, that of the 
constraint forces by 25* , we have 

, 72 *. 


If a virtual displacement is again performed, then the constraints by 
themselves can do no work, and we must have 2 t Z k 8 r k — 0. We 
thus obtain the relation known as d’Alembert's Principle: 

= ° (23) 

Since .the equations of condition limit the freedom of the Sr !c , we 
cannot immediately conclude from (23) that the individual terms of 
the sum vanish. We must, in fact, proceed exactly as in statics. Again 
we labej the co-ordinates consecutively, remembering that each mass 
occurs three times, i.e. m* n = m in _ x — m 3n _ 2 . The relation (23) 
then becomes 

2^ — m { x,) hx ( — 0 (24) 


Let the l equations of condition again be finite equations. Differentia- 
tion gives 




Bxj + 


bx z + 


M 

3*3 x 




= 0 . 


We again multiply each of these equations by a multiplier X,. (for 
the present undetermined), and add them to equation (24), whence 
we have 

2, (f, - +*£+*£+...». If) **-*■ <»> 

Exactly as in the statical case, we can dispose of the l quantities X k 
so that the first l parentheses vanish. The (3 N — l) remaining paren- 
theses contain as many displacements as there are independent vari- 
ables present. Since we may choose these hx ( at will, the remaining 
parentheses must also vanish separately. We then obtain a 
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system of equations known as Lagrange’s Equations of the First Kind: 


« h x 1 ~F l + + + • . - 

9 *i 

» a -f,+ a 1 |A + a 1 |£*4... 

,'x^ 

OXf 

n hs * 32 f — F sir + A x + A 2 -j- 

..’.A, 1-4 

ox 3S 


In this case also we easily see that the .terms added to F i in the right 
member are the components of the constraining forces, which must be 
introduced as the forces corresponding to the geometric conditions, 
if we wish to treat the particles as free. The number of equations is 
the same as the number of unknowns. The Lagrangian equations 
contain, besides the 32V co-ordinates of the particles, the l multipliers; 
on the other hand, we have, in addition to the 3 N Lagrange equations, 
the l equations of condition 

F k (X! . , . % v ) = 0 . 

If the integration of the entire system of equations (26) (including 
the l equations of condition) can be performed, we obtain the com- 
ponents of the constraining forces, since the A* are also known. These 
forces of constraint are equal and opposite to the pressures or tensions 
exerted by the particles on the guiding mechanism which secures the 
geometric conditions. Such forces were studied for a single particle in 
the case of the pendulum. A pertinent example, where external forces 
are absent, is that of a particle whirled about at the end of a string 
tied to a fixed point in a horizontal plane, the force of gravity being 
balanced by the reaction of the plane. The centripetal force exerted by 
the string is the equilibrant of the centrifugal force of rotation of the 
particle. 

Another way of looking at the question, convenient in many com- 
putations, is the following: The fundamental equation 



may be considered to express the equilibrium between the applied 
farce F and the inertia force — m(d?r/dt 2 ). In this instance, however, 
Newton’s second law must be formulated to read: 

During the accelerated motion, a force — m{d?r/dP) acts on the mass m 
and there is equilibrium between applied and inertial force. This view 
formally refers dynamics back to statics, and the considerations of 
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statics may be applied to dynamics, so that we arrive again at (23). 
The difference in the two approaches may be illustrated by the ex- 
ample of a spring-type weighing machine, standing on which is a 
person who produces accelerated motion by flexing his knees. With- 
out the introduction of inertial forces we have 

cPz „ 

=* -”* + *, 

where F is the restoring force in the spring. The other view says that 
the force F' exerted by the spring must equal the resultant of the applied 
forces. The latter are the gravitational force —my and the inertial 
force — m(d?zjdF), so that 



According to the third law, F' must equal —F, so that both approaches 
lead to the same result. While the second formulation often yields the 
formal result more quickly, it must be remembered that the first one 
usually takes into account the physical situation more satisfactorily. 


5. The Lagrangian Equations of the Second Kind for Arbitrary Co- 
ordinates (Generalized Co-ordinates). 

The Lagrangian equations of the First Kind refer to rectangular 
co-ordinates. However, it is often more useful to specify the positions 
of the particles by means of other parameters. This almost invariably 
permits us to regard the equations of condition as equivalent to hold- 
ing certain co-ordinates constant, and these co-ordinates then drop 
out of the conditional relationships. Although the introduction of con- 
ditions is also possible with any system of co-ordinates, we consider 
here the normal case where, in place of 3 N rectangular co-ordinates 
and l equations of condition, we have f—ZN—l independent para- 
meters q lt q 2 , ■ ■ ■ qt ■ For example, let all points of the system be 
required to remain on the surface of a sphere. In this case the correct 
co-ordinates are spherical co-ordinates, r, <f>, 6, and the variable quan- 
tities are only the angles <f> and 6. It remains to express the equations 
of motion in these arbitrary co-ordinates. For this purpose we 
imagine the rectangular co-ordinates to be given as functions of the 
independent q k (called the generalized co-ordinates) 

*<=*<(?!, ??. • • •?/)• . . . - . (27) 

Further, we seek to replace the a;*, and x t appearing in the d’Alem- 
bert equation 


Z t (F t - **,,£<) 8 x, = 0, 


(24) 
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by the and their time derivatives. Now 

dx { - . dx t * , d-Xi 

and similarly x< 


Ht 


dxt . dx, . 9 x { . 

dg q ' + dg q * + -‘-d£f 4 '’ 


■ (28) 
• m 


The expressions x ( (dx { jdq k ) which result from substitution of (28) in 
(24), may be transformed as follows^ it is verified at once that 


9 a;, 

C< 3?V 




. d /dx A 
dt\dq k /‘ 


(29) 


Now we can consider the x 4 to be functions of the 2/ variables q k 
and q k * 


Equation (28') then gives 


9 ± t _ dx t 
dq k dq k 


(SO) 


Then the first term in the right member of equation (29) becomes 


d 

dt 


/ . 3®A d/.dxA d /, dx*\ 


. (31) 


The second term may also be transformed in a w&f which leads to 
easily interpreted expressions. We have 


d /9®A ?j % x t . d*Xt . 

dt \dq k ) ~ dq^dql 91 + + " ‘ 

. 

, tf. 

oq.fdq k 


9 /dxt . , dx ( . , N 

“5h(si !,T !5: fc+ -, 

Hi*- 

. (32) 

which X, ~ ~ ($ xf). 


. (33) 

Then, according to equations (28) and (29), 



m { £ f 8x, = £* i*M { a — Sq*. 

(33') 


* The introduction of the q k as new variables, in addition to the qu is apt to be 
misunderstood, but consideration of the following example of a similar kind may 
help to remove the difficulty. The only independent variable is the time, upon which 
the %i and it depend, through the medium of the qt- However, if a function /(«, y') 
is to be differentiated with respect to the independent variable x, where y' stands for 
dyjdx, we use the formula 
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We further assume that the acting forces have potentials U i} which 
we imagine to be given as functions of the q k . Then 


^ 83 ;,= -8 U { : 


( dU i 

= ~W 8sl 


,du is . , du< s \ 


(34) 


dq, ™ r '"dq r 

If we now substitute expressions (33') and (34) in the statement of 
d’Alembert’s principle, there results 


°- (35) 

But S \wifX* represents the total kinetic energy T of the system. Then, 
since the bracketed expressions must vanish, individually, on account 
of the independence of the displacements 8 q k , and since V depends 
only upon the position co-ordinates q k and not upon q k , we obtain 
the following Lagrangian Equations of the Second Kind for conser- 
vative forces; 

d d(T- U) _ 0 (T - U) = 
dt dq k dq k 


= 0 . 


(36) 


The difference between the kinetic and potential energies, which is a 
function of the time (through the position co-ordinates and their time 
derivatives), is called the Lagrangian Function L of the system: 

L—T— U (37) 

Using this notation, equation (36) may be written 
d dL d L 


dt dq k dq k 


= 0. 


(38) 


If, in addition to the conservative forces F it non-conservative forces 
F/ are also acting, then the work done by the latter in a virtual dis- 
p’acement is given by inserting the values of dx ( from equation (28) 
(p. 119)': 

sw - w* - *■>*<’ (H *«> ■ + *». ■ + • • • I; »*) 

The coefficients of the S q k are called the components of the forces 
referred to the generalized co-ordinates q k , and are denoted by F\ k . 
With the admission of non-conservative forces, the Lagrangian equa- 
tions of the second kind thus become 


d dL_ 
dt dq k 


dL = 
dq k 


( 40 ) 
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It is to be remembered that we have treated only the most common case, 
i.e- the holonomous-scleronomous system defined on p. 114. For a discussion of 
the oases in which the equations expressing the x t as functions of the q k 

(а) contain the time itself explicitly (holonomons-rheonomous system), and 

(б) are given in the form of non-integrable differential equations Sx { = 0,(0 
Sg, -f o a ( 0 Sq t + . . . O/W Sq f (non-holonomous system), consult the more com- 
plete textbooks on mechanics. 

Ex. 46. Set up the equations of motion, in spherical co-ordinates, of a particle 
on the suriaoe of the earth. 


6. Generalized Momentum Co-ordinates. Hamilton’s Equations. 

In the case of holonomous-scleronomous systems, the x { are given 
as functions of the q k by finite equations which do not contain the 
time itself, and the kinetic energy T is a homogeneous quadratic 
function of the time derivatives of the q k : 

T = - is»., (ff i + |f|#,+ If;#/)' 

- + fw + .-. • + aPn j,4 +...). . (41) 


Now in rectangular co-ordinates, each component of the momentum 
is given by 

3 T 


p xi = mfli - 


(42) 


In analogy with this, we call the following quantities the generalized 
momentum components: 

Vu = d ~ lf k - = PuS 1 + Pv4t +■■■ Pf*4f> • (43) 

although these quantities may or may not have the dimensions of 
momentum, according to the meaning of the generalized co-ordinates. 
Since the instantaneous state of a system is uniquely characterized 
by giving the position and velocity (momentum) of every point, we 
speak of momentum co-ordinates instead of momentum components, 
as the q k and the q k have already been given equal rank in the La- 
grangian equations of the second kind. 

According to equation (43), the momentum co-ordinates are linear 
functions of the generalized velocity components q k . Since the de- 
terminant. of the system of equations does not vanish (as a more 
thorough analysis shows) the system may be solved for the q k , so that, 
reciprocally, the velocity components q k are also linear functions of 
the generalized momentum co-ordinates p k . If we express the equa- 
tions of motion in terms of the generalized momenta, we obtain a 
new form, due to Hamilton. This form furnishes the best starting-point 
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for many investigations in higher mechanics, especially in celestial 
mechanics, and so has come to be called the canonical form of the 
equations of motion. Introduction of the momentum co-ordinates 
(43) into the Lagrangian equation (36) (p. 120) gives at once for con- 
servative forces, since U is independent of the q k , 

m 

This equation is still unsatisfactory in that the Lagrangian function 
in the right member contains the q k instead of the p k . We could try 
to express the q k in terms of p k by means of the transformation (43), 
but it is simpler to introduce a new function E of q b and p k defined by 
H(pk> ?*) — tepid* — -f>(?i> ik)- • - - - (45) 
The total differential of this function is 


= + 2 k q*dp k - S* d ~ dq k - S* || dq k . . (46) 


Since the potential energy depends only upon the position co-ordinates, 
and not upon those of the velocity, we have 


dL _ 8 T(q k , q k ) 
dq k 3q k 


^pk- 


. . tm 


As a result, the first and the third terms in the right member of equation 
(48) drop out, and comparison of coefficients gives 


4k 


dj{_ 

dp k 


(48) 


3 L 

H*" 


_ d_n 

Hk 


. ( 49 ) 


Substituting these 
obtain 


results in the “ mixed ” equation (44), we 


dp k __ dH 
dt dq k 


(49') 


The canonical form of the equations of motion is thus 

jPk^ _ ZH(p k , q k ) 1 
dt dq k I 

dq h _ 3H(p k , q k ) | 

dt dp k J 


( 50 ) 
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For conservative forces, to which we restrict ourselves, the meaning 
of H is very simple. If we form the total time derivative 



this derivative vanishes, on account of (50). H is thus a constant of 
the system. Using equation (45) (p. 122), it is readily shown that E 
is the same as the total energy E: the kinetic energy T is a homo- 
geneous quadratic function of the q k , according to equation (41) (p. 121). 
According to Euler’s Theorem on homogeneous functions, we there- 
fore have 

S *«*§f = 2 T~% k q k p k , .... (52) 

as may be verified readily in this instance. It then follows from 
equation (45) that 

E(p t , q k ) — 2T - (T — U) = T + U ~ E. (53) 

7. Hamilton’s Principle 

Besides the Lagrangian and Hamiltonian forms of the equations 
of motion of a system in arbitrary co-ordinates, there is a third 
form which is of the greatest importance. This third statement gives 
the equations not as differential equations, but in the form of a 
stationary condition. The idea of expressing a law of nature by spee ; - 
fying that a certain quantity is to have an extreme value in the actual 
process, is as old as scientific thought itself. The advantage lies in 
the simplicity of the formulation and its independence of any assump- 
tions concerning the particular co-ordinate system selected. The 
derivation of su6h an extremal law of mechanics, from what has pre- 
ceded, is not difficult. Comparison of the Lagrangian equations of the 
second kind with the Euler differential equation of the simplest varia- 
tion problem (p. 77) shows that they are identical. In place of the 
independent variable x we now have the time, and the function 
designated by F on p. 75 is the Lagrangian function L. Thus the 
Lagrangian equations of the second kind correspond to an extreme 
value of the integral 

W *=jT Ldl — f*‘ ( T -U)dt. 

The time integral of twice the kinetic energy is called the notion. We 
are thus able to express the equations of motion in this manner: The, 
natural motion of a system ( i.e . the actual motion which takes place accord- 
ing to the laws of mechanics) is characterized by the fact that the time 
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integral of the Lagmngian function, taken between two configurations of 
the system has an extreme value. This is the celebrated Hamiltonian 
Principle. The foregoing derivation oilers no information as to whether 
the value spoken of is a maximum or a minimum, or merely stationary, 
but this question is of no consequence in what follows. In most instances 
IT is a minimum. 


The mathematical expression of the principle is 
SW 8 J^Ldt =8 J*' (T ~U)dl = 6 f‘'(2T - E) dt = 0, (54) 


where E represents the total energy. 

If forces which are not derivable from potentials also 
the Lagrange equations become 


d8L_8L_ r 

dt 8q k 8q k 


0 . 


act, then, by § 8 (p. 120 ) 


(58) 


In this case the Lagrange equations no longer represent the Euler differential 
equations of a variation problem. However, they may be brought into this form 
if it is possible to find a function M of the co-ordinates and their derivatives 
such that 


d 8M _ 8M_ _ r 
dt 8q k 8q k «*' 


In this oase the funotion V defined by 

L'=L— M=T — U — M (57) 


satisfies the differential equation 


d 8V 
dt dq k 


( 68 ) 


and this is the Euler differential equation corresponding to 

S jL'dt =0. ........ (59) 

This oase oocurs, for example, in the mechanics of an electron in a magnetic field. 
The magnetic force (of. p. 428) is given by F m ag = — (e/e)[» H]. Since this force 
is always normal to the path (the tangent to the orbit is given by v) no work is 
done. It is therefore not possible to derive this force from a scalar potential. 
If, however, we put H = curl A, it may be shown that the function M = e/c 
(Av) satisfies (66). Naturally, it is not always possible to find a funotion M, 
i.e. the system of equations (66) may not be integrable. 


8.* Canonical Transformations.* 

It is often necessary to change from one system of variables to 
another. We might proceed by introducing new position co-ordinates 
by means of equations of the form 

* Sections marked with an asterisk may be omitted at a first reading. 
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and then, using the time derivatives of these equations, express the 
kinetic energy as a function of the new position co-ordinates and their 
time derivatives. Finally, with the aid of the equations 



the new momentum co-ordinates could be determined. Hamilton’s 
principle, however, makes it possible to solve the problem much more 
elegantly, and at the same time much more generally. Let us change 
position and momentum co-ordinates simultaneously, and inquire how 
the transformation relationships must be constituted in order that the 
new variables may again have the character of position and momen- 
tum co-ordinates, i.e. that the canonical equations may still be satis- 
fied. For this purpose, it is necessary and sufficient that the new vari- 
ables again satisfy the Hamiltonian Principle. But, in order that this 
be true, it is sufficient that the difference of the integrands of the 
action, expressed in the new and in the old variables respectively, 
should be the total time derivative of a function of the old and new 
co-ordinates; that is, by (45) (p. 122), 

— Hip* ?*) — + 3{p k , q k ) = ^ f(q k , q k ), (60) 

for if this equation is integrated between the limits t 0 and t 1} we obtain 
f — H(p k , q k )]dt — jT [S f k q k -H(p k , q k )]dt 

= [/(?*, ?*)L, ~ [/(?*, ?*)]«,- 

Upon taking the variation of this expression, the right side vanishes, 
since the end-points are fixed (cf. p. 78). Since the variation of the 
first integral is zero, the variation of the second integral must also 
vanish, which means that the new variables are again canonically con- 
jugate position and momentum co-ordinates. By carrying out the 
differentiation in the right member of (60), and comparing coefficients, 
the transformation formula are obtained in the form 

j 

% _ _ Vfab ?*) ( 61 ) 

Vk ~ ~n v~ I 

H(Pk, ?*) = 3(p k , q k ) j 

The q k may be computed in terms of the p k and q k from the equations 
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of the second line in (61). If we then substitute these values of the 
q k in the equations of the first line, we obtain the p k aa functions of 
the f k and q k . These cumbersome calculations are, however, necessary 
only in rare cases in this general form. The arbitrary function /(}*, q t ) 
is called the generator of the transformation. 

Instead of starting with a function of the old and new position 
co-ordinates, we may start with a function of old position co-ordinates 
and new momentum co-ordinates; this leads to relationships of par- 
ticular importance. Let us add the expression 



to the right member of equation (60). This may be done, since the 
above expression vanishes when we take the variation of the time 
integral, just as does the expression dfjdt. We then write 

- H(p k , q k ) -- Hp k q k + H{f k , q k ) 

_ ?*) _ i^jM* /a 21 

~ dt dt * * ' } 


and obtain, by comparison of coefficients. 


j> s = 


Wlhli 

3?) i 

3/(g» V*> 

dp k 


K(P*> ?*) ^ H(f k , q k ) 


9.* Cyclic Variables. The Hamilton- Jacobi Differential Equation. 


If the Hamiltonian function— -which represents the total energy 
if the forces are conservative— does not contain a certain position 
co-ordinate q it then it follows from the first of the Hamiltonian equa- 
tions that 


§P* _ 0 

dt~~ dq t ~ U * 


(64) 


That is, the conjugate momentum co-ordinate is a constant of the 
system. Such a position co-ordinate is called cyclic. The name is due 
to the fact that the polar angle <f> in central motion has this property. 
If all the position co-ordinates are cyclic, it follows from the second 
of Hamilton’s equations, viz. 


is* = d H (Pi> ?2”-- pa 

dt dp k 


lh, • • • V f ), 


. (65) 
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that the dq k ]dt, being given functions of constants of the system, are 
themselves constants, and the position co-ordinates increase linearly 
with time. The integration of the equations of motion may thus be 
performed immediately in the case o r cyclic variables. A very impor- 
tant method for the integration of the equations of motion is based 
on this fact. Instead of starting with Lagrange’s or Hamilton’s 
equations, for example, and trying to integrate them, we can begin by 
trying to find the generator of a canonical transformation which leads to 
only cyclic variables. If we call the momentum co-ordinates, which are 
constant in time, a*, this generator must be a function S of the old 
position co-ordinates and of the new momentum co-ordinates, such 
that 


H(Pk> ik) = H(a k ). 


According to equation (63) we must put 


„ bS(q k , a k ) 


and q k = 


as 

0 a*' 


. (67) 


The function S is, then, determined by the following differential 
equation: if we substitute dS/dq k for p k in the Hamiltonian function, 
we have for conservative systems 

*(!!•«>)=* « 

which is the Hamilton-Jacobi Differential' Equation. Here E is the 
energy of the system, and must thus be considered a constant of the 
system, since the differential equations themselves do not contain 
the value of the energy. We are therefore at liberty to identify E with 
one of the constants a*, e.g. cq. The Hamilton-Jacobi differential 
equation is of the first order, but second degree, since H is a quadratic 
function of the p k \ the complete solution, then, contains / constants 
of integration (if there are / position co-ordinates), one of which may 
be set equal to E. Instead, we could introduce /arbitrary constants; 
but in this case E must be a function of the <q, cq, . . . a r , and may 
no longer be taken as an arbitrary integration constant. Thus, the 
solution is either 


S(q k , S, a* 0 , ... Of) , 0; E = iq, . . . (69) 


or more generally, 


o s ...a / )=0; S-2S(« 1( <*,...«,). (70) 
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In the former case, we see by (65) that all of the dq k fdt with the ex- 
ception of dqjdt are zero, since * E — H. 

There is an intimate connexion between the function S introduced 
here as the generator of a transformation to cyclic variables and the 
action function (Hamilton’s characteristic function). During the 
course of the motion, the a k are constant, and so their differentials 


are zero. This means that the differential of S is given by 
J5 q 

dJS — 2 dq k s= Spjdj's = 'Lp le q k at. . . (71) 
Now by equation (52) (p. 123), 

i.e. dS = 2Tdt ....... (72) 

and S-S 0 = 2f h TcU (73) 

On the other hand, the action function is, by § 7 (p. 123), 

W z=f2Tat-Et (74) 

Therefore, W — S — Et (75) 


Thus S represents the part of the action W which does not contain 
the time implicitly. On account of the relationship (75), S is also 
known as the contracted action function. 


10.* Periodic and Multiply Periodic Systems. Angle Variables; the 
Angle Variables of Keplerian Motion. 

We consider first a system having a single position co-ordinate x, 
e.g. a linear oscillator. The system is said to be periodic with period 
T, if the eauation , M , ... 

’ - x(t -j- nT) — x(t) 


is satisfied when n is any whole nmnher. According to § 9, a cyclic 
variable <f> increases linearly with the time, i.e. 


= yt, t = ? 


For this reason, the co-ordinate x is also a periodic function of the 
cyclic co-ordinate <f>. We then call the cyclic co-ordinate <f> an angle 
co-ordinate w if the period is unity, i.e. when 

x(w + 1) = x(w) (76) 


* It is impossible to avoid the double designation of the energy by E and by H, 
for the following reason: H(p k , g k ) signifies the Hamiltonian funotion of the position 
and momentum oo-ordinat.es. For conservative systems, this is identical with the total 
energy E, which is a constant of the system. On passing to cyclic variables H will de- 
pend also only on quantities constant in time. In this case we may write E in place 
of H. 
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Id. the simple case of a point moving uniformly around a circle, the 
polar angle cj> is a cyclic variable, but the angle variable is the quantity 
w = <j>l2Tr, for in the latter quantity the rectangular co-ordinate x 
is periodic with period unity. The momentum co-ordinate con- 
jugate to an angle co-ordinate is called an. action variable , and is 
designated by J. 


The advantage of using angle and action variables lies in the 
fact that we obtain the frequency of the system immediately by 
differentiation of the Hamiltonian function; for if E is given as 
a function of the action variables (H cannot be a function of 
the conjugate position co-ordinates, since every angle co-ordinate is 
cyclic), then 


dw 

'di 


— i - e * w — v 1 + s - . • • • (? 7 ) 


On the other hand, every periodio function of frequency v is repre- 
sentable, subject to conditions as to continuity which are by no means 
onerous, by a Fourier series of the form 

x = (78) 


The series is a periodic function of the quantity vt, with period unity. 
But w is an angle variable by hypothesis, and so x is also a periodic 
function of yt, with period 1, i.e. y is identical with v. What constant 
of the system must be introduced as a new momentum co-ordinate in 
order to make the conjugate position co-ordinate not only a cyclic 
variable but also an angle variable? At any rate, the generator of the 
transformation must satisfy the Hamilton-Jacobi differential equation, 
and is therefore again denoted by S. If S is known as a function of q 
and J, then we have, as in § 9, 


and 


J) 

3q 


(79) 


On account of the constancy of J, the increase of w during the course 
of the motion is 


dw 


d*S(q, J) 
dqdJ 


dq. 


(80) 


Our requirement that w be an angle variable may be expressed as 
follows: when the system has returned to its former state, w must 
have increased by 1. If we integrate the increase of w over a period, 
we must have 

l -f dw -f^Tj^^hfrqdq = ~jfpd q . (81) 

IE 711) 
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This requirement is satisfied if we introduce the so-called phase 
integral 

J — £pdq ( 82 ) 

as a constant of the system. The symbol j) denotes integration o /er 
a period of the system— -that is, over a complete range of variability 
of the co-ordinate q. 

According to the above considerations, the process of finding the 
angle variables amounts to the following: the Hamiltonian function 
(i.e. for conservative systems, the energy) is given in terms of any 
arbitrary co-ordinates p and q. Replace p in H(p, q) by dS/dq, and 
obtain the Hamilton-Jacobi Differential Equation (68) (p. 127), which 
becomes an ordinary differential equation if there is but one variable. 
If this equation is solved* or at least reduced to a quadrature, form the 
integral 

< 83 > 


over a complete period of q. The integral becomes a function of the 
integration constant E. If the equation is solved for E, we obtain — 
on the one hand— the Hamiltonian function as a function of the action 
variable, and therefore the frequency is given as 


dE dH 
v = U == dJ' 


(84) 


On the other hand, by substituting J in the integral of the Hamilton- 
Jacobi equation, we can complete the transformation to angle variables. 

Let us illustrate tlie process with the simple example of the linear harmonic 
oscillator. We have [cf. Chap. V, § 7, p. 93], 

that is, p=-mx, T = 


The Hamilton-Jacobi equation thus becomes 

id M*- 

or — V2mJS — m 


Since p = dS/dx must re main real, we can fin d the limits of x at once: * must 
extend from V2E/k to z ma2r =■ + V2E/k. As we go from x m i n to x matJ 
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» inareases, go that x, and therefore p and dS/dx, are positive. Cn the return, 
•a decreases, and dS/dx has the negative sign. The complete integral 

J — (j)~tdx £ V2ro.fi? — mkx* dx = Vmi (j) — 3?dx 

thus represents Vmk times the area of a oirole of radius V2JB/L That is, 

.2 E ,-r _ J fk 

J = ~y rtVmk or ® “ gjt Y ju" 

From this result, the frequency is 

_ dH __ 1 r& 
v dJ 2n V ro’ 

as already derived on p. 94 by elementary methods. Indefinite integration of 
dSjdx, and the replacing of E by 

L 

2k 

yields the generator 8 of the transformation to angle variables, and differen- 
tiation with respeot to J gives to. The calculation is not troublesome, since the 
integral 

VhmE — rofer* dx 
is easily evaluated as an indefinite integral. The result is 



The Hamilton-Jacobi method appears muoh more complicated fees® than the 
direct integration, as oarried out in Chap. V, § 7. As a matter of fact, the full 
value of the Hamilton-Jaoobi method first becomes evident in connexion with 
complicated systems in Celestial Mechanics, or Atomic Mechanics, where it 
proves to be an indispensable aid. 

The application of these principles to systems of more than one 
degree of freedom requires development of the concept of multiply 
periodic — or conditionally periodic — systems, which occur when 
there are several degrees of freedom. Let the rectangular co-ordinates 
be periodic functions of the angle variables w k with period 1, i.e. 

®*(«h + 1, ti> a + 1, w 3 + 1, . . .) = x k (w u m> 2 > • • •), (86) 

and generally, for whole numbers 

®*(«h + »i, + n 2 , . . .) = w 2 . . .). . (87) 

Just as Fourier’s series is the most general representation of periodic 
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functions of one variable, so the general analytical expression for 
functions of several variables, having as many periods as there are 
variables, is the multiple Fourier series of the form 

x* = . . . G) nv ( (M«pi+iwii+v»+ •••>. . (88) 

Since the w k increase linearly with the time, we may substitute the 
expressions v,J. + S* for w k in the Fourier series. Taking the factors 
e 2 jn'n*Jt originating in the phase constants into the coefficients, we 
then have 

x* = 2,2^2, . . . D* mj) e 2W(^ 1 «+ flM v,«+n,M+ ...). . (89) 

This multiple Fourier series no longer represents a periodic function 
of the time. It is true that any one factor, e.g. e 2 ’™' 1 ' 1 ' returns to its 
initial value after T l = \jv x sec; but this term is multiplied by other 
factors which certainly do not return to their old values 

since the separate v k are taken to be independent of each other. A 
true periodicity in time can be obtained only by imposing additional 
conditions on the v k , e.g. the condition v 1 — v % = v A = 

For this reason, multiply periodic systems are also called con- 
ditionally periodic. A simple example of a conditionally periodic system 
is a particle performing harmonic vibrations along the x- and y- axes, 
the two vibrations having incommensurable frequencies. As shown 
on p. 60, the resulting orbit is a so-called Lissajous figure which never 
closes, if the ratio of the two frequencies is not a rational number, but 
which, in the course of time, covers all points within a given rectangle. 
All orbits of conditionally periodic motions are of the type of the 
Lissajous figure; a certain portion of space is densely filled, yet the 
orbit is not closed, and thus a true periodicity in time doe3 not exist. 

Conditionally periodic motions occur in general if the Hamilton- 
Jacobi equation (68) (p. 127) can be solved by “separation of the 
variables ” — the only feasible practical method, assuming that the 
system has properties of periodicity at all. By separation of the variables 
we mean the solution of the equation by putting, in those cases where 
it is possible, 

S (q» 2a> 2s • • • 2/) = Si(2i) + £ 2 ( 22 ) + S 3 (q s ) + ... S f (q,), (90) 

,v . dS dS k ■ /r .,. 

so that — ...... (91) 

and the equation may be separated into a number of ordinary differen- 
tial equations of the form 


(92) 
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the energy E becoming a function of the constants of integration: 

E =?= E{ay, 02 , 03 .... a f) (93) 

The search for the angle and action variables proceeds just as in 
the case of one degree of freedom treated above: let the angle and 
action variables which are sought be denoted by w k and J k respec- 
tively, and let S be the generator of the transformation, as a function 
of the old position co-ordinates q k and the new momentum co-ordinates 
J k . 

Imagine all the position co-ordinates q k except q h held constant. 
Since the action variables are constants, the change in the angle 
variables corresponding to a change in q h is 

d h w k = dq k (94) 

Now, since 

- ™(q k , **) 

the above expression becomes 

< 95 > 

We now postulate the separability of the Hamilton- Jacobi equation 
in the co-ordinates q. ; , which we therefore call the separation variables. 
Thus 

£T,f> < 96 > 

The change in v> k corresponding to passage through the complete range 
of variation of q h is 

(97) 


If we 


introduce the phase integrals 




as action variables, then 

A h w k =1 for h — k 
— 0 for h 4 = & 


(98) 

(99) 


But this means that the variable of separation q t is a periodic function 
of w k , with period unity. While it is true that each of the variables q k 
is simply periodic, the periods of the separate q k are different, and so 
the system is multiply periodic. Hence the rectangular co-ordinates, 
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which may depend on the g* in many ways, become multiply periodic 
functions of the w k . The variables w k and J k , introduced in this manner, 
thus fulfil all the conditions imposed above on angle and action 
variables. 

If the energy is expressed in terms of the phase integrals rather 
than the constants of integration a k , the frequencies of the system fol- 
low by differentiation since, according to (65) (p. 126), and the above 
discussion, 

— — 2# 
v *~ <u ~dj; 

Let ua illustrate the method with the example of Kepleriaa motion treated 
on p, ' 90. Since the force is a function only of the distance r from the ©anfcre of 
attraction, three-di men sional polar co-ordinates are suggested. The line element is 


given by 

^ dr* + rW + r* sir, s 9d 9 », , , . . . (100) 

Then Jm(r» + r a a a -f r 8 sin 1 ®^*), . . (10X) 

and the momentum co-ordinates become 

p f «= mr, p$e= ror’9, p# =» mr* sin 5 6 9. . . . (102) 

Expressed in terms of these co-ordinates, the kinctio energy becomes 

T -s('' , + ?* f +KE ' »»’) <“> 

The potential energy for the gravitational force between m and the central mass 
Mis 

U = ...... (104) 


Replacing the p k by dS/dq k in the Hamilton-Jaeobi differential equation, we 
have 

• c»» 

We put 

8 - *V(r) + 5,(8) + S<,( V ), ..... (106) 

and obtain 

' <"*> 

W© can now separate the equation into three equations with and as® as con- 
stants of integration: 

• • <«*> 



( 109 ) 
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<I09 '> 

K [($)"+ tG-!-*- <no > 

p r = = ^2mE + - “I* (1100 


These equations are somewhat more general than (92) and (93), since (109) and 
(110) contain the integration constants of (108) and (109) respectively. The 
subsequent procedure, however, is the same as above. By the second form, the 
integration is reduced to quadratures. In order to calculate the phase integrals 
J k , the limits of the variables must be determined; the second form of the 
equations is adequate for this purpose. Since a* is constant, p*, and therefore 9, 
always have tho same algebraic sign. If the system is conditionally periodic at 
all, then a complete period of 9 is from 0 to 27t, Then 

a# 29 = 2 raty; (Ill) 

The oo-latitude 0 behaves differently. Since pn must remain real and finite, the 
limits are given by 

8in0 max = sin^rr — Omin) = — (112) 

ae 

With increasing 9, the positive sign of the radical is to be taken; for decreasing 
0, the negative sign, as explained in detail for the case of the linear oscillator 
(p. 130). The value of the integral taken from 0 m ui to 0 ma i and back to 0 m ; n is 

Je f yja e * - -£|Ld0 = 2jt (o« - a*), . . . (113) 


as may be calculated readily. Then 

If we substitute this value of 09 in the expression for dS r /dr, we obtain 


J r - f ^2mE + dr. . . . (114) 

If the motion is periodio or conditionally periodic, then, here too, the zeros of 
the expression under the sign of the square root, or radieand, determine a minimum 
and a maximum of r, and these quantities must be positive, since the radius vector 
is always positive. The solution of the quadratic in r, obtained by setting the 
radieand equal to zero, gives two positive values of r only if IS is negative. This 
is in harmony with the result of Chap. V, § 6, p. 92, that the orbit is hyper- 
bolic for E positive — in which case there is no finite extreme value of r — while 
negative total energy leads to the periodic elliptic orbit. With the assumption 
that E is negative, the integral is to be extended from r mln to r max , and baok to 
r m iu with the opposite sign for the radical. The evaluation of the integral by 
elementary means is somewhat cumbersome, but possible; however, the result 
is obtained much more readily by complex integration (of. p. 73), and is 


J r 


Jo - d* + 


2 t vmb 


(116) 
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which gives the energy 


™ _ —Zizhnb* 

~ <yrF/r+j^ * 

E in terns of the action variables: 
E — B(J r , Jt, J+). . . 


The frequencies may be determined at 
the special case where 

dH 

v '“«7 r “ v — 


dH 

8J, 


by different: 



. . ( 116 ) 

• - (H7) 

Here we have 

. . ( 118 ) 



Since we may set up as many phase integrals as there are co-ordinates, and 
since the energy, in general, depends upon these integrals individually, one 
should expect as many independent frequencies as there are degrees of freedom 
'--in the present case, three. If the number of independent frequencies is smaller, 
the system is said to be degenerate. The difference between the number of degrees 
of freedom and the number of frequencies is called the degree of degeneracy. The 
Kepterian motion is thus doubly degenerate. In this very frequently occurring 
case of degeneracy, the quantities J r , Je, J$ are not yet the correct action vari- 
ables. If we remove the degeneracy by introducing additional small forces 
(“ perturbations ”), wa most insist that some of the angle variables of the result- 
ing non-degenerate system pass over into constants of the degenerate system 
when the perturbing forces are reduced to zero. This is not the case, however, 
for the angle variables which correspond to J T , Je, J^. The true angle variables 
may be obtained by starting with a non-degenerate system and allowing the 
perturbing forces to vanish. These variables, which are known in astronomy as 
the Delaunay elements, have come to be of great importance. We quote them 
here without verification: the first action variable is the sum 
J , i =* J r 4- Je -1- 
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The associated angle variable is the mean anomaly M, which is related to the 
eccentric anomaly u, occurring in the parametric representation of the ellipse 
x *=■■ a 008 u, y mm b sinu, by the relationship 

M = 5^“ ~ eslnu), 

where c is the eccentricity. The second action variable is 2 tc thnee the total 
angular momentum: J 2 = 2 nP. The associated position co-ordinate is given 
by 1/2tt times the angle between perihelion and the line of nodes, the latter being 
the line of intersection of the orbital plane and the ay-plane of a fixed 00-ordinate 
system. This oo-ordinate is oonstant for an unperturbed Kepler ellipse. The 
third momentum co-ordinate J t is 2rc times the z-oomponent of the angular 
momentum, i.e. J 3 = 2nP,. The associated position co-ordinate to, is l/2rc 
times the angle between the line of nodes and this z-axis; this also remains fixed 
for an unperturbed Kepler ellipse (fig. 1). 



CHAPTER VII 

Teb Mechanics of Rigid Bodies 
1. Selected Topics in the Sin ematics of Rigid Bodies. 

(a) The number of degrees of freedom of a rigid body 
A rigid body is understood to be a system of particles, the indi- 
viduals of which are rigidly inter-connected, i.e. always maintain the 
same distance from each other. We therefore leave out of consideration 
deformations which occur in all actual physical bodies. 

The position of a rigid body with respect to an arbitrary co-or- 
dinate system fixed in space is determined if the position of the origin, 
and the orientation, of a. co-ordinate system fixed in the body are 
given. In order to fix the origin of the latter co-ordinate system we 
need three co-ordinates; to specify the orientation of this system, we 
may use the direction cosines of the axes referred to the system fifed 
in space. If we denote these cosines by a t , y,-, then 


i' — a j* -f- & j -f~ y x k ] 

/“ « 2 * + &/+ y Je !>. . . . . . ( 1 ) 

k' = a 3 i -f fi 3 j -f yjk J 

Bnt there are six equations connecting these nine coefficients, viz.: 

1 = *"*=01*+ A 3 + yi 2 ) 

1 +&*+** \ (2) 

l — k' 2 = tt 3 2 + /? s 2 + y s 2 j 

0 = t'j‘ = a T a 2 -f + y x y 2 "j 

o =/# = agtig + && 4- y 2 y 3 k (3) 

0 = k'i' = a a a^ -j- && -f y sYl j 


i.e. there are but three additional free variables. The position of a 
rigid body is thus completely determined by six numbers (co-ordinates); 
the number of degrees of freedom is therefore six. 

It often proves useful to introduce three independent variables, 
in place of the nine direction cosines conditioned by six equations. 
Por this purpose the Etder angles 9, if, <js are suitable. These angles 

1S8 
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are defined as follows (cf. fig. 1): 9 is the angle between the z'- axis 
and the z-axis. The ry-plane cuts the a/y' -plane in a straight line, 
which we again call the line of nodes, and which we define by the unit 
vector n. We draw a straight line and corresponding unit vector m' 
perpendicular to n and in the x'«/'-plane. The angle xft is the longi- 



tude of the node, i.e. the angle between the line of nodes and the 
x-axis. <f> denotes the angle, in the x' 3 / -plane, between the x'-axia 
and the line of nodes. If the Euler angles are given, it is not difficult 
to draw the axes i', f, k' . Using ifi, one can first draw the line Gf 
nodes in the xy-plane. Through this line pass a plane making an angle 
9 with the xy-plane; this is the x'y'-plane. The normal to it is the 
z'-axis. In the x'y'-plane, the position of the x'-axis is then determined 
by <j>. 

The axes 0*', 0,/, 0 / are fixed in the body, and 0 a , 0 V , 0 Z are 
fixed in space. 

The number of degrees of freedom is diminished if separate points 
of the rigid body are held fixed. If, for example, one point is fixed, 
we place the origin of co-ordinates 0 at that point; the position of the 
body is then determined by the three Euler angles 9, \(i, <j>, and so the 
number of degrees of freedom is reduced to three. If we hold two 
points fixed, the only possible motion is rotation about the axis formed 
by these points. The position is then fully determined by giving the 
angle of rotation, i.e. the angle between a line in the body normal to 
the axis and a plane fixed in space, through that axis. A rigid body 
in which two points are fixed has then but one degree of freedom. 
This last degree of freedom is also removed if the position of another 
point, not in the axis, is prescribed. We shall consider the motions 
possible for a rigid body under these various limitations, and finally 
shall touch upon the general motion of a free rigid body. 
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(6) Rotation of a rigid body about a fixed axis 

The angular velocity and the direction of the axis together may 
be characterized by a vector in the following manner; we assign a 
vector to in the direction of the axis, such that the rotation is clock- 
wise when looking in the direction of to, and such that the magnitude 
of to is equal to the angular speed to. This representation has a moan- 
ing only if we can show that the angular velocities so specified follow 
the laws of vector analysis, i.e. that two angular velocities around 
intersecting axes combine to give a third angular velocity whose 
axis and magnitude are determined by vector addition. This is actually 
the case. If coj is the first angular 
velocity, the linear velocity v x of a 
point whose radius vector, referred to 
a point 0 on the axis, is r is given in 
direction and magnitude by the vector 
product for the linear speed (see 

fig. 2) is equal to the product of co x by 
the perpendicular distance p of the 
point P from the axis, i.e. equal to 
I r x | sin(co 1 »'). Moreover, t> 1 is nor- 
mal to the axis, which is also true of 
['Ojf']. If now a second axis through 0 
is given, then the linear velocity corre- 
veloeity o> a is given by v 9 ==* £»**']. 
The resulting linear velocity is thus 

v t mm far] + [u) 8 r], 

which, by the distributive property of the vector produot, is 

= | (coj + w 2 )r | = | tor | (4) 

That is, we may replace the angular velocities Oj and to, by the 
angular velocity to which is obtained from to x and t o t by vector addition. 
If we multiply both members of equation (4) by dt, we have the same 
law of combination for infinitesimal angular displacements (rotations). 

However, this important rule for combining angular velocities and 
infinitesimal rotations must not be understood to hold for the analo- 
gous case of finite rotations about two intersecting axes. True, one 
may assign to every finite rotation a directed line-segment whose 
length is proportional to the angle turned, but — as may be seen at 
once by simple examples — two such rotations combine in a much 
more complicated manner than two vectors. A finite rotation signi- 
fies a transformation of axes, i.e. a tensor; and while two successive 
transformations may be replaced by a single one, the replacement 
cannot be made as simply as for angular velooities. 
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(c) Plane motion of a rigid body 

Next to rotation about a fixed axis, the simplest case would be the 
motion of a rigid body about a fixed point, according to the above 
enumeration of degrees of freedom. However, it is advisable before 
taking up this case to consider a motion which also has three degrees 
of freedom — plane motion, in which all points of the body move 
parallel to a given fixed plane. All points on a line normal to this 
plane then describe congruent 
paths, and so it is sufficient to study ^ 

the motion of one plane moving / \ \. 

over another fixed plane. One may 

picture this motion as that of a \\ / /' , % 

sheet of paper moving about on a Is' ''''' 

plane table- top. It is evident that 

this motion has three degrees of p ° 

freedom, for if we fix one point of Fi «- * 

the moving plane by giving its two 

co-ordinates, we can still turn this plane about the fixed point; the 
position of the moving plane is completely fixed by further specifying 
the angle of turn referred to a fixed axis. It may be shown that the 
general motion at any instant may be looked upon as a rotation about 
an instantaneous pivot point, the instantaneous centre; for if we 
select two arbitrary points Ppand P 2 , the motion of every other point 
is determined by the motion of these. After a time A t the two points 
are at Pf and P a ' respectively (fig. 3). Draw the perpendicular bi- 
sectors of P 1 P 1 ' and PgP a '; let these intersect at P 0 . The triangles 
PJPyPi and iVYPj' are congruent, 
since the construction makes PJP X — ^ 

W and PoPjj^PoPj, and also 

PjPj — PfPf on account of the V 

rigidity of the body. As a result, ° \ , 

the triangle PJPfPf can be brought 

into superposition with triangle 

P^Pa by rotation about P ? . Since ° 

the position of the body is deter- herpaUwde \ 

mined, for plane motion, by the two Fig 4 ' 

points P x and P a , which were chosen 

at random, the same construction with any other pair of points must 
also lead to the point P 0 . If A* is allowed to approach zero, the seg- 
ments P X P X and PjPj' become the tangents of the paths, and have the 
directions of the velocity vectors, aiyl are normal to P X P 0 and P,P 0 
respectively. The instantaneous state of motion may therefore be looked 
upon as a rotation about P 0 . This point is the instantaneous centre. 

If the position of the instantaneous centre is marked in the fixed 



142 


MECHANICS 


plane, as well as in the moving plane, we obtain (except in the trivial 
case of simple rotation about a feed point) a curve in each plane. 
The curve in the feed plane is called the herpolhode’, that in the moving 
system, the polhode, According to their definition, both curves must 
have the instantaneous centre in common at any instant. An infinitesi- 
mal rotation about this point must bring the neighbouring points of 
both curves into coincidence. This is possible only if the two curves 
touch at the pole and roll upon each other without slipping. This is 
immediately evident from a diagram in which the curves are pro- 
visionally replaced by broken lines {fig. 4, p. 141 ). 

{d} Motion of » rigid body about a fixed point 

The motion of a rigid system about a feed point bears the same 
relation to plane motion as spherical geometry bears to plane geometry. 
Just as the points on a normal to the feed plane describe congruent 
orbits in the case of plane motion of a rigid body, so here the points 
on a radius through the feed centre describe paths which are similar 
spherical curves, similarly placed with respect to the fixed point. It 
is therefore sufficient to examine the motion on any spherical surface 
having the fixed point as centre We may thus picture the situation 
as the sliding of a spherical shell upon an inner feed sphere. We can 
apply the same construction as we used for plane motion, and so 
obtain an instantaneous centre on the sphero. If we trace the path 
of the instantaneous centre, we obtain two spherical curves, the her- 
polhode and the polhode. By rolling the latter curve upon the former, 
the motion is determined exactly as in plane motion. In reality, we 
are not dealing with two spherical curves, one rolling on the other, but 
with two cones, if we consider all the concentric spheres. In the same 
way, in the case of plane motion, we really have two cylinders, whose 
generators are normal to the plane, rolling one upon the other. 

(e) Arbitrary motion of a rigid body 

Since the position of a rigid .body is determined by the position 
of one point, and by the orientation of a set of axes feed in the body, 
a change to a neighbouring position can be effected by a translatory 
motion which brings that point to its new position, followed by an 
infinitesimal rotation of the axes to their new position. We can always 
arrange matters so that the axis of instantaneous rotation lies in the 
direction of the translational motion, and a rigid body may therefore 
always be moved to a neighbouring position by means of an infinitesi- 
mal screw motion. W T e give this result without proof, since no further 
use will be made of it. If we again trace these instantaneous screw 
axes in both the moving and feed systems, we obtain two ruled sur- 
faces. In the course of the motion, these surfaces not only roll upon 
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one another, in plane motion or spherical motion, but there h a 
simnltftTieons slipping along the instevtaneons common generator. 

S. General Statics and Dynamics of Rigid Bodies. Iguiratance of 
System* of forces acting upon Rigid Bodies, 

According to Chap. Y (p. 85), the necessary and sufficient con- 
dition for the equilibrium of a single particle (i.e, for the absence of 
ail accelerations) is the vanishing of the resultant of ell applied forces. 
‘In the general mechanics of systems of particles, the vanishing of the 
resultants of all force vectors means that there is no acceleration of 
the centre of gravity of the system: i.e. if the centre of gravity was 
at rest before the system of forces having zero resultant was applied, 
It 'remains at rest when the set of forces is applied. This still, docs not 
cay that the system of particles remains at rest, for rotational accelera- 
tions about the centre of gravity may still, take place. Fox a rigid 
body, the vanishing of the resultant moment 
M *» S [r 4 PJ 


is the necessary and sufficient condition for the absence of this kind 
of acceleration, also. The reason is that the rigid connexions of the 
separate parts ensure that the changes in angular momenta of the 
mart* are all of the same sign, so that it is impossible for these changes 
to compensate each other, as may occur in an arbitrary system of 
p&rtielea. As a result, when X ~ 0, equation (7) (p. 110) becomes 


dP d v , 

* dt = .! ; 


>«*<]• 


As we have just seen, the absence of a change, is. the total angular 
momentum P implies that there is no change in the angular momentum 
of the separate parts. If then P also vanishes, and no motion existed 
at :Srst s tke system remains at rest. The necessary and sufficient con- 
ditions for the equilibrium of a system of forces applied to a rigid body 
are therefore the vanishing of the resultant force and of the resultant 
moment of the forces: 


E F< = F — 0 and EfoFj *= M = 0. , , {5) 

The effect of a system of forces on a rigid body depends solely 
upon these two resultants; This is evident for statics, but in kinetics, 
too, only these two vectors occur, as we shall see in the following sec- 
tions. All systems of forces which yield the same resultants F and M 
ars entirely equivalent in affect. 

While the vector quantities with which we have dealt thus far 
could be moved about parallel to themselves at will, without changing 
their significance, this is no longer true of the forces acting upon a 
rigid body. It is evident that if we displace parallel to itself a force 
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vector acting at a point P { , we change the point of application in such 
manner that the moment of the force is altered. The moment is un- 
changed only if the displacement is along the vector. A force acting 
on a rigid body is a vector localized in a line and may be moved only 
along its line of action. 

The value of the resultant moment depends, in general, upon the 
position of the reference point O. There is an important special case 
where this is not so, viz. if the resultant F vanishes. If, for example, 
we shift the reference point to O' by a vector d, and denote the vectors 
radiating from O' by primes, then 

r" — r— d, i.e. 2[>/F ,] = S[r ( F ( ] — 2[c?F,] 

= Z[r { E ( ]-[dn . . ( 6 ) 

The expression S[fi?F<] vanishes, no matter what the direction of d, 
if £F< = 0, so that the centre of gravity remains at rest or moves 
uniformly. 

The scalar product F3S is independent of the point of reference in 
any case, for if we multiply both members of equation (6) scalarly by 
F, the term F[«?F] vanishes, and there remains 

FStr/Fj-FS^FJ (7) 

It may be shown also that there is always an aids such that if any point 
thereon be chosen as reference point, M will have the direction of F. This axis 
is called the central axis of the system of forces. If we denote the moment referred 
to O' by ST, the vector M' should be so determined that 

M'= M— [cfF]«= yF, (8) 

where y is a scalar which is yet undetermined. Scalar multiplication by F yields 
M'F = MF = yF*, 


and [OF] - M — Y F - M — SgP ~ (9 ) 

From (9) we see that 

+ (W) 

where X is an arbitrary variable scalar quantity. This means, however, that the 
terminus of d is on a straight line parallel to F. If, in particular, M is parallel 
to F, the central axis passes through O and is parallel to F. 

The simplest system of forces having a vanishing resultant F but 
a non-vanishing H consists of two equal and oppositely directed forces, 
F applied at the point r x and — F at r 2 . This system is called a couple. 
The resultant moment M == [(r x — r 2 ) F] iff a vector perpendicular 
to the plane of the two parallel forces and equal in magnitude to F 
times the perpendicular distance between the forces. Since we saw 
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that the effect of a set of forces applied to a rigid body depends only 
upon F and M, we see that any system of forces may be replaced by 
a single force F and a couple M whose moment is equal to the resultant 
moment of the set of forces. 

If all the forces acting on a rigid body are copianar — a frequent occurrence 
in practice — then the magnitude and line of action of their resultant may bs 
found in a vivid way by means of two graphical constructions known as the 
polygon gf forces and the funicular polygon. In fig. 5a the four applied forces f\, 
Fj, F*, F 4 are shown each in its correct line of action. In fig. Sb the resultant F 
is determined by the familiar force polygon construction. In order to find the 
correct line of action of the resultant, we select au arbiter, ry point O and draw 
lines from it to the various comers of the polygon. These lines may be thought of 
as auxiliary force vectors F 0 ,i, Fj,*, Fa s, F 3/4 and F- i0 . From the triangles thus 
formed it appears that each pair of auxiliary forces is in equilibrium with one of 
the given forces, provided that we apply both of the former at a single point on 



Fig. 5 


the line of action of the given force. Except for Fo,i and F 4|0 , however, afi 
auxiliary farces enter twioe, each time with opposite signs ; and sinoe the two 
members of a pair act in the same line they cannot change anything connected 
with the equilibrium conditions. Inasmuch as equilibrium obtains after the 
introduction of the auxiliary forces, F 0 ,i and F 4 ,o must be in equilibrium with 
the resultant F, and the line of action of the latter must pass through the inter- 
section of those of the former two forces. Hence in diagram (a), through any 
point of Fj we draw a line parallel to F 0 ,i and another parallel to Fi, a . Through 
the intersection of the latter with F 2 we draw another line parallel to Fa, s , and 
so on. The intersection of F 0 , i and F 4 , 0 is a point on the resultant, whose direc- 
tion is that of the closing side F of the polygon in (6). 

This sequence of lines supplied to the diagram of original forces is called the 
funicular (Latin: pertaining to a rope) polygon because a rope would assume this 
shape under the action of the given forces. In the event that the foroe polygon 
happens to be self-closing, the directions of Fo, • and F 4 , 0 will be parallel and the 
resultant will be a couple. If the two parallel linos are coincident, in which case 
the funicular polygon will be closed also, the given forces will be in equilibrium. 

Ex. 47. A beam is supported at its ends and carries a concentrated load of 
P kg. at a point $ the distance from one end. By reversing the construction just 
described, find the magnitude of the downward forces on the supports. How 
does one proceed when several loads are applied? 
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3. Solution of a Bigid Body about ». Fixed Axis, Moment of Inertia 
sind its Calculation. 

'We begin the subject of the dynamics of a rigid body with the 
simplest case — “ one-dimensional ” motion about a fixed axis. This 
motion is determined by a single co-ordinate, the angle of tarn 
Thinking of the axis as a material axh, we may consider it fixed by 
.bolding two of its points, 0 and O', in bearings. Let the resultant of 
the external forces be. F, and the resultant moment, M If we apply 
the general equations of the mechanics of systems of particles, thus 
treating the body as free, we must also take into account the reactions 
at the bearings. That is to say, if the bearings were not present, the 
axis of rotation would, in general, tend to move in some direction. 
This is prevented by the bearings, and so the axle exerts a pressure 
on the bearings. We may consider the bearings replaced by forces 
which are equal to and opposite to the pressure exerted by the axle 
at any instant. Cali these forces, acting at 0 and O', V r and IP/ 
respectively. If we put the origin of co-ordinates at 0, the point of 
application of one of these forces, then ff, vanishes, and the equation 
for the rate of change of tiro total angular momentum of tbs system 
becomes 

f = W] + M (11) 


W© now put the 2 axis of our co -ordinate system along the axis of 
rotation. Since the vector product [y'F/J has no 2 -component, on 
account of r' — r'k, the reaction force for the ^-component of equation 
(11) drops out, and we obtain the simpler equation 


dF t 

dt 




(12) 


which we will now treat further. If the angular velocity vector is 
o> - <xk, (13) 


then 

P = — Smjr i [cn?' i ]] 

= or >1 rn ( vf — 2?;vr,-(«nr,-) 

= <ah'Lm i r? — <ii'Lrft, i r i {r i k), . . . (14) 

and 

P z — u}T,m i r i i — a>Em i {y i kf‘ = (i>'Em i [r i fe] z 

= cuSm/sj 8 + j/i 2 ). 


(15) 
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The quantity 

= Sr^fa;/ + yf) —■ 'Zm, i R i l . . (16) 


is called the moment of inertia of the body about the 2 -axis, and 
is denoted by J se , for a reason explained below. With this notation, 
(12) gives the equation of motion 




(17) 


Since the position of the rotating body is determined by the angle <f>, 
no other equations are needed if wo wish to study only the course of 
the motion. We must bring in the remaining equations of motion, 
however, if we wish to find the pressures at the bearings. Comparing 
(17) with the statement of Newton’s Second Law for the case of one 
degree of freedom of a particle, viz. 


dhc 
m df 




W8 see that the two equations are mathematically identical, and that 
the following paired quantities correspond to each other: 


Moment of inertia I„. 

a i , .. d s <j> 

Angular acceleration --gj. 

Component, along the axis of 
rotation, of the moment of 
the external forces. 


Mass in. 

Linear acceleration 

At 1 

Component of force in the 
direction of motion. 


Moreover, the kinetic energy of rotation becomes, in complete analogy 
with the ease of linear motion, 

<“> 

as may be verified readily. 

In the mechanics of a particle, the vibrational motion taking place 
under the influence of a quasi -elastic force ■— — Tex is of importance. 
Is. the rotational motion of rigid bodies, also, there often arise re- 
storing torques which are proportional to the angle of turn, and whose 
representative vectors are in the direction of the axis. Such a turning 
moment is of the form M = — r<f>, and the equation of motion becomes 

I^+rt-0, (19) 


where in place of l ec we write I for brevity, it being understood that 
we mean the moment of inertia about the axis of rotation. The factor 
of proportionality -r is called the moment of torsion-, it has the dimensions 
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of force X distance, i,e. work. A torque of this kind arises, for ex- 
ample, when a rigid body, suspended by a wire clamped at the upper 
end, is slightly twisted about the wire as axis. Since equation (19) is 
iddbtical with equation (24) (p. 93), apart from notation, we can 
write down at once the frequency of free vibration of a rigid body 
under the influence of a restoring force proportional to the angle of twist: 

_ 1_ jr _1_ / moment of torsion 
V 2 try I 2tt y moment of inertia ’ ‘ ' 

In order to determine 7, we must calculate the quantity 
where h t is the perpendicular distance of the ith particle from the 
axis. For a continuous distribution of matter of density p, the sum is 
to be replaced by the integral { J j phtdxdydz. We shall find, below, that 
the moment Of inertia about any axis may be- found without another 
integration, provided that this quantity is known for a particular set 
of three mutually perpendicular axes of the body intersecting at the 
centre of gravity. We show in the first place that the moment of 
inertia about any axis not passing through the centre of gravity may 
be calculated from the value of the moment of inertia about a parallel 
axis through the centre of gravity. If r is the vector to the centre of 
gravity from an origin on any selected axis, and if we denote by primes 
the radios vectors from the centre of gravity to the various particles, 
we have 

r ( =r+r /. . (21) 

If the axial direction is denoted by the unit vector c, 

I = Xm i [r { cY='Lm i [rc] t + I.m i [r i ’cY, . (22) 
since = 0 (of. p. 107), i.e. the term 

2 Ew„[rc] . [r/c] 2 [rc] . [Sm t r/, c ] 

vanishes. 

The second term of (22) represents the moment of inertia referred 
to the axis passing through the centre of gravity; the first term re- 
presents the moment of inertia of the body about the new axis, con- 
sidering the entire mass concentrated at the centre of gravity. This is 
Steiner's Theorem: 

The moment of inertia of a rigid body referred to an arbitrary axis is 
equal to the moment of inertia referred to a parallel axis through the centre 
of gravity, plus the moment of inertia of the entire mass about the given 
axis, assuming it to be concentrated at the centre of gravity .* 

* It is sometimes convenient to introduce the so-called radius of gyration k defined by 
Le. t is the distance from the axis at which the entire mass may be considered oon- 



VII.] 


THE MECHANICS OF RIGID BODIES 


149 


We now calculate the moment of inertia about an arbitrary axis 
through the reference point 0. If the direction cosineE of the axis, 
referred to an arbitrary fixed system of co-ordinates, are cos a, cos /?, 
cosy, then 

I — 2 TO.foc]* = 2m ( { (y { cos y — e ( cos jS) 2 -f (z { cos a — x t cos y) 2 
+ (Xf cos £ — y, cos a) 2 } 

— Sm^y, 2 -)- z, 2 ) cos 2 a -}- 2 m^z,- 2 -f- $ 4 2 ) cos 2 )3+ 2 -{- y 4 a ) cos 2 y 

- 22«i 4 y 4 z 4 cosy cos — 22w» 4 z 4 cc 4 cosa cosy 

— 2'Zm t x i y i cosa cos /? „ (23) 

The moment of inertia is thus a quadratic function of the direction 
cosines of the axis. The coefficients of this quadratic function are of 
two kinds; those of the squares of the direction cosines are the moments 
of inertia about the three co-ordinate axes: 

£m<<y ( 2 + *<*) *= hr ) 

'LmW + x?) = I vv }- (24) 

4- y?) = h, i 

The coefficients associated with the mixed products of the direction 
cosines are called the 'products of inertia: 

ImiXiyt = I X1) I 

2m 4 y 4 z 4 — ht [ (25) 

Urn^ZfXf = I ta j. 

The moments of inertia, as functions of the directions of the axes, 
can be represented graphically by means of a surface, as follows. 
Draw the radius vector R from the origin 0 in the direction of the 
axis about which the moment of inertia is taken. Take the length of 
this radius vector so that 



Referring the resulting surface to a system of co-ordinates with origin 
at 0, we have 

cos a = ^ = y/l x, cos/J=v7y, cos y^^lz, . (27) 

Then, according to (23), the equation of the surface which is the locus 
of the end points of R is 

I xx x* + 7 vv y 2 -f- J„z* - 2 I xv xy - 2/„yz - 2 I te zx = 1. (28) 

eentrated without altering the moment of inertia. The theorem may then be stated: 
“ The square of the radius of gyration about any axis is equal to the square of the 
radius of gyration about a parallel axis through the centre of gravity plus the square 
of the distance between the two axes.” The other conclusions of this section also 
may be expressed in terms of the radius of gyration. 
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This equation, being of the second degree, represents a eonieoid, or 
quadric surface, It is an ellipsoid (in special cases, an ellipsoid of 
rotation, or a. sphere), since the moment of inertia of a physical body, 
referred to any aria, cannot vanish, and hence no radius vector 
R ljs/l can become infinite. Since the quantities l xx , &c., depend on 
the choice of reference point, there will be a different ellipsoid of 
inertia for every point of reference 0. Now, according to analytical 
geometry, for every surface of the second degree there is a particular 
system of co-ordinates, that of the principal axes, in which the equation 
of the. surface has the simple form 

A®' 2 + /.Jr'* + V'* = 1. .... (29) 

The three quantities T v J„ I, are called the •principal moments of in- 
ertia. If the axes of the principal moments and the magnitudes are 
known, the moment of inertia for any other axis through 0 may be 
evaluated graphically or analytically as the inverse square of the 
radius vector drawn in the direction of the axis in question. If 0 is 
the centre of gravity, the moment of inertia for any other axis may be 
found by applying Steiner’s theorem. A second-degree surface has 
six independent, coefficients. In general, then, one must measure six 
moments of inertia, in so far as the positions of the principal axes of 
inertia (which correspond to the axes of symmetry) cannot be deter- 
mined in advance on grounds of symmetry. The six quantities deter- 
mining the ellipsoid of inertia are to be interpreted as follows— -three 
for the principal moments of inertia and three additional ones for the 
orientation of the system of principal axes within the body. 

The expression for the kinetic energy of a rotating rigid body is 
specially simple in the co-ordinate system of the principal axes of 
inertia. In general we have 

T — —■ JEmjjoif,]*. 

and since 

«*> = (DC, 

this expression becomes 

T = -- Wl. 

But we have already calculated the expression for 'Zm i \cr\- in the 
co-ordinate system of the principal axes. Since l K * — 0 for k 4= l, we 
have from equation (23),* 

T — cos 2 a -(- 1 2 cos 2 /? -j- / 3 cos 2 y), . . (30) 



and since 


(31) 
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resolution of to along the principal axes of inertia fixed in the body 
makes the expression for the kinetic energy take the form 

T = + / 2 ov 2 + Z 8 «v s ) (32) 

Ex. 48, Calculate the moment of inertia of a solid, homogeneous oifbe about 
one )f ob diagonals. 

Eh, 49. Theory of Kater'a Pendulum , If a rigid pendulum has the same 
period when oscillating about two parallel axes at unequal distances e and «' 
from the centre of gravity, then the length of the simple pendulum having the 
same period is given by s + s'. Prove this. 

Ex. CO. What must be the distance of the point of suspension of a pendulum 
from its centre of gravity in order that a small change ia this diataneo may have 
a minimum effect on the period T 

Ex. 51. For many purposes, a diatomic molecule may be represented by a 
rigid “dumb-bell ” model, in which the masses Af, and are a, fixed distance 
a apoare. This system rotates as it flies through space. Represent the kinetic 
energy as the sum of translational and rotational energy. 

4. Motion of a Rigid Body about a Fixed Point. Elements of the Theory 
of the Top. 

(a) Relation between angular momentum and angular velocity 

For the investigation of the motion of a rigid, body about a Used 
axis we needed only the component of total angular momentum in the 
direction of the axis, and not the forces holding the axis in place, so 
long as wo were interested only in the progress of the motion, and not 
in the forces at the bearings in which the imaginary axle was con- 
sidered to turn. If, now, one of the bearings is removed, and the axle 
is constrained only at one point 0 , the axis will not, ia general, re- 
main stationary, but will turn about 0. If we wish to apply the theorem 
concerning the total moment of momentum, we must consider again 
the relation between the angular momentum vector P and the angular 
velocity vector <o, given (p. 146) by 

P = S m t r?(ia) — ‘Lrn i r i (r i o»). 

Writing the equation in components, 

P« — 2 »*,(*<* + y? + «vV* ~ -'r t/ s w„ -f- z t (u t ), 

OS P* — I 0a Cife — l av co v — 1 

and similarly, P v — —I^a> x + I vv oo v — l vz o j s i , ... (33) 

P z = -l xz a> s - l vz co y + J 

That is, the components of F are linear, homogeneous functions of the 
components of to. The matrix of coefficients is symmetric, since the 
terms symmetric with respect to the diagonal are equal. Starting with 
a given vector co, the correspondence between the vectors P and to, 
determined by six coefficients, must, of course, always lead to the same 
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vector P, in whatever way the vectors are resolved into components. 
But, according to p. 36, this is the criterion that the six coefficients 
should be the components of a symmetric tensor — the inertial tensor. 
According to the explanation on pp. 36 et seq., one can find the vector 
P corresponding to a given vector to by using the tensor ellipsoid: 

■fast* 2 + — 2I xv xy — 2I vt yz — 21 ^zx = 1. (34) 

This ellipsoid is identical with the ellipsoid of inertia discussed in the 
preceding section. To find P, draw a radius vector r parallel to to. 
Then P has the direction of the normal to the ellipsoid at the terminus 
of r. The magnitude of P is determined as on p. 37 by the equation 
upon which the construction of the ellipsoid is based. The normal 
and the radius vector have, in general, different directions, so that 
the vector of the resultant angular momentum does not, in general, 
have the same direction as the axis of rotation. The two are parallel 
only for the directions of the three principal axes of inertia. If we denote 
by accents the components referred to this co-ordinate system, which is 
determined by the form of the body and the distribution of mass, then 

P* —■ fi«v i 

P V ’ — ItfOyl | . ...... (35) 

P,- = J a “V ' 

This co-ordinate system — in which the axes of co-ordinates are the 
axes of the ellipsoid of inertia — is fixed in the body, and thus changes 
direction when the body turns. On the other hand, all the theorems 
already proved, such as the law of conservation of angular momentum 
in the absence of external torques, are based on a system of co-or- 
dinates fixed in Space. Fot example, if in this case of P fixed in space, 
we imagine the vector P to be watched by an observer on the rotating 
body, he will observe an apparent motion of this vector. 

In mechanics, any rapidly rotating rigid body is called a top. The 
easiest case to treat is that of a top having three equal moments of 
inertia. It is called a spherical top. Naturally, the body itself need 
not be a sphere; it is only necessary that the mass be so distributed 
as to make the ellipsoid of inertia a sphere. By (35), P and o> will 
have the same direction for a spherical top. For other reasons, the 
tops or gyroscopes encountered in practice are rotationally symmetric 
bodies having two of their principal moments of inertia equal, with 
the consequence that the ellipsoid of inertia is an ellipsoid of rotation. 
Tops having three different moments are very difficult to handle 
mathematically and almost never occur in engineering practice. They 
have their place, nevertheless, as models for uasymmetric polyatomic 
molecules. We shall restrict our attention to the symmetric top. 

Let us choose the A'-axis as the axis of symmetry of the rotor, 
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also called the axis of figure. This will also be the axis of figure of the 
inertial ellipsoid. Call I the moment of inertia about the figure axis, 
and J that about an axis normal to the figure axis. Call the ratio 
IjJ = 8 . The numerical value of 6 must be between 0 and 2, as shown 
by tlje following reasoning: on account of the rotational symmetry, 


= y', and we have 

I —j (*'* + y' z ) dm = 2 J x'Hm, (36) 

J = I/O =/V 2 + z' 2 )dm = £7 +fz'*dm. . . (37) 


Since 1/2 occurs in (37) with a positive quantity added to it, we must 
have 1/2 < 1/0 < oo , or 2 > 5 > 0. 

For a symmetrical top, h' , the moment of momentum P, and the 
instantaneous angular velocity vector to all lie in one plane; for the 
construction of P by means of to, using the inertial ellipsoid, is carried 
out in a meridian plane. The same result follows analytically pro- 
vided that to can be written as the sum of a P component and a k' 
component. First, if we write to and P in component form, referred to 
the set of axes fixed in the body, and if we combine the components 
of to that are normal to k' to form a vector to„', we obtain 

to = tOtft' -f- j' -J- — to n - -j- , . (38) 

F — ^ (to*- i' + aty j ) + = -q to„< -J- Icojk'. (39) 

Eliminating to B -, 

to = j P — (0 — 1 )«vA>' (40) 

Fig. 6 represents graphically the content of equations (38) to (40). 

(6) Symmetrical top subject to no forces 
Consider now the behaviour of a symmetrical top on which no force 
moment acts. In this instance the vector P will be constant, and so 
differentiation of (40) gives 

But according to p. 140 the rate of change dr/dt of a radius vector r 
due to rotation is given by [t or] ; and since any vector, including the 
axial unit vector k\ may be represented as the difference of two radius 
vetotors, we have 
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Hence ~ (l - 9) 6' -f (1 ~ fakT), . . (42) 

It follows from the last relation that is constant; for if we farm 
the scalar product of this equation with ti . it follows from (41) that 
with fo'ta — &.y, that dco^jdt = 0. However, not only «y hut also the 
magnitude <o of to is constant; for if we form the scalar product of (42) 
with to, and use the fact that ay « const, because — 0, we get 



But since ay = tak\ the angle between to and h' must slao be con- 
stant. According to fig. 6, which is valid at any instant, the angle 



Fig. 6 Fig.? 


formed by P and h' is also constant, and the only possibility of a 
change in the vectors is for the piano of the figure to rotate. The axis 
for such a rotation must fee the invariable line ,P. The general motion 
of a force-free symmetrical top is called nutation. In it, the instan- 
taneous axi3 «o describes a cone about the axis ,P, which is fixed in 
space, while the top itself rotates about so. On the other hand, any 
motion of a rigid body about a fixed point may be represented accord- 
ing to § 1 (e) by the rolling of the poihode on the herpolhoda, the in- 
stantaneous angular velocity being that of the generator common to 
both cones. Thus in the present instance the herpolhode is the ocae 
described about the axis P by so, while the poihode is described by the 
motion of to around k' (fig. 7). Both are right circular cones. Since 
an actual gyroscope always has high speed about the figure axis, and 
since the nutation arises from disturbances, as will be explained below, 
the angles between k' , P and to axe small because P has very nearly 
the same direction as k' . However, the angular speed of rotation of 
h' around the fixed axis P is comparable with «y, and so ; s large. This 
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follows from the fact that the component in question is, by (40), 
(0/1) F, Taking as an approximation | P | — /&>„-, the magnitude of 
this component becomes 0o>,<. In a practical case, 0 dees not differ 
much from unity. 

Example-. The wandering of the earth's pole. The, earth is a top whose angular 
speed is small hut whose moment of momentum is large, due to its tremendous 
rotation inertia. The pciar flattening makes 6 = 301/300. Displacements of 
matter cause slight irregular wandering of the position of the pole over ti e earth’s 
surface -within a distance of about four metres! According to the preceding de- 
velopment, the angular speed around P will amount to 2ir(30i)/300 day 1 . This .is 
the speed with which an observer fixed in space would see the axis of the earth 
rotating about P. Referring to a terrestrial observer, we must subtract the 
normal daily rotation rate of 2ir day 1 , leaving 2;:/'30G day 1 , or a period of 800 
days. The observed period is 420 days, the departure being due to the deforma- 
tion of the rotating earth. 

Let ns examine one more special case of the motion of a force-free 
top. Suppose to to be constant in direction as well as in .magnitude. 
In tjtfs case equation (42) can be satisfied only if to and h 1 are parallel, 
in which case (40) shows that P must be parallel to h l also. Thus, if 
the axis of rotation coincides with the axis of figure, the top can con- 
tinue to rotate about this axis indefinitely. This is the origin of the 
term “ moments of deviation ” used in German works to denote the 
products of inertia; for any axis other than the figure axis these 
mixed terms do not vanish, and rotation about such an aids ie accom- 
panied by “ wandering ” of the axis of figure. 


(e) i Symmetrical top under the influence of external moments 
Imagine the top K to be contained in a housing G, fig. 8, which 
itself is free to move in all directions — for instance, by being allowed 
to float in a liquid. The centre of gravity of the top itself is assumed 
to coincide with the centre of the indicated sphere, Let ins axis of the 
top be again the k' axis. First of all, the top must be set into rotation 
by a force moment having the direction of k’. Once the desired rate 
of rotation is attained, this moment may be gradually decreased. Then, 
if friction is negligible, the top will continue to run indefinitely at this 
speed. Thus it is assumed that any moments thereafter applied to the 
top shall not have k' components. Under such conditions «v will be 
constant whether or not external moments exist. To see this, set the 
value of P from (40) into the fundamental equation (7), p. 310, obtaining 


dP 

at 


Scalar multiplication by k' yields 


M t = 0 = 






(*_!) * or = . (45) 
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dk ‘ A , dk! A 

since k = 0 and co = 0. 

We assume now that the housing rotates with angular velocity £2, 
and inquire as to the moment needed tc produce this rotation. Since 
co is referred to a fixed co-ordinate system rather than to one connected 
with the housing (and hence rotating), one would assume off-hand that 
co would be the vector sum of Si and <o t k' . However, sinoe the absence 



of friction in the bearings prevents the k' component of S2 from 
affecting the top, we must put 


co = £2 -f- (av — £2*-) k' — -J- ta^k', . . (46) 

Where £2*' denotes the component of £2 that is normal to k‘ . Only by 
assuming a relation of this form can we insure that k'ta = «o,>. 

Now, let the total change in angular momentum be resolved into 
two components, one of which represents a uniform rotation of the 
housing. Remembering (46) and substituting into (44) the expression 
for co given by (46), we get 


M 


IdSi„- , 
6-W + 


I 


dk' 1 „ ... dk! 


IdSl n -, T dk' 

enr+^'it- 


m 


^=[£2£']=[£2 n <£'3 


On account of 
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we obtain these important forms for the fundamental equation 


I d£i n ‘ 

eir 


dV 

at 


i aa n . 
e at 


-f J&v {ft*'#]. 


(«)• 


For' any practical gyroscope, Ic«v represents the main part of the 
angular momentum; let this quantity be denored by P. 

Consider the following special cases: 


(1) Uniform rotation of the homing about a fixed axis. 

This type of motion of the housing, and hence of the axis of the 
top, is much more important than the nutation described above. It 
is called precession, and is the most obvious of all gyrosoopic pheno- 
mena. Assume a co-ordinate system j", k" connected with the 
housing, and let the £"-axia coincide with the axis k' of the rotor 
(fig. 8). Since the component of £2 parallel to k' does not contribute to 
the vector product [Q&], equations (48) become for this case 

M = P~==P[£A'] (48') 

The change dk' \dt in the unit vector k' must be normal to k' . Further, 
k' rotates as time passes, so that the applied moment M must have 
this same rotation. This will be true, in the case of a force having a 
constant direction in space, if its line of action does not pass exactly 
through the centre of gravity of the top. For example, as in fig. 8, let 
a mass m be suspended from the housing at one end of they "-axis, the 
axiB of the rotor being in an inclined position. Denoting the unit vector 
of gravity by g 0 and writing 8 for the angle between g 0 and j" (which 
is the same as the angle of inclination of the rotor axis), we have 

M = mga [j"g^ — —mga(Bm8)i" ; . . . (49) 

i.e. the moment always remains perpendicular to k" (or to h'). 

For a given moment, equation (48') in itself yields only the magni- 
tude of the component of the angular speed that is normal to h' . The 
direction of the. axis about which the housing continuously turns is 
obtained as follows: 

Since dk'jdt lies in the direction of M, then according to (48') and 
(49), dk'ldt must be normal to g 0 ; hence (dk'jdt) g Q = 6. Integration 
yields 

k' go + cos(w/2 — 8) = const. .... (50) 

Thus tihe axis of the rotor must tiym in such a way as to maintain a 
constant angle with the direction of gravity, i.e. the axis must de- 
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scribe a cone about the vertical. The magnitude of. the precessional 
angular velocity is, by (48') and (49), 


Q. n _ M/P mga tan 8 

cos 8 cos 8 p 


(51) 


A gyroscope, then, dees not yield to gravity as does a non-rotating 
body — it “ turns aside at a right angle While section (6) showed 
that the aperture of the <xme of notation remains small for small dis- 
turbances, the aperture of the cone of precession is determined a priori. 
Even when the applied moment is small, an easily perceived change in 
direction of the axis soon results. It is only the speed of precession 
that becomes small as the moment is reduced. 


(2) Notation of a force-free top produced by impulses. 

Let a sudden moment M be applied at time t = 0 to an otherwise 
force-free top rotating about its axis of figure, and let this moment 
continue to act for At seconds. Denoting the time interval of the 
moment by N, integration of equation (48) with respect to the time 
gives 

N = f* TAdl = ~ O n . + lo> z . C ~ dt l Q, n . + Io> z .Ak'. (62) 

Jq u Jq dt u 

For any impulse, the change in velocity in a short time At is considerable, 
while the change in position is small to a higher order. For this reason, 
Ak' may be neglected. Hence, after impact, a new component of rota- 
tion normal to the figure axis and of amount £l n . — N djl is present. 
This gives rise to the case of nutation of a force-free top discussed in 
(1), the quantity £2 a - here corresponding to io n - there. Apart from 
external impacts, internal rearrangements of matter can also act to 
deflect the axis of P from the c*> direction [of. (5) above]. 

Precession and nutation may exist at the same time, as when a 
processing top is subjected to external impulses, or when shifting of 
part of the mass takes place. An impulse also must be considered to 
operate at the instant when the moment responsible for precession 
is suddenly applied, for a short-lived force may be represented by 
sudden application of a steady force, followed immediately by the 
application of the reaction force. Since the cone of nutation always 
has a small aperture, the nutation has only the effect of producing a 
corrugation of the surface of the cone of precession. This is usually 
negligible in technical applications. 

(3) Example of the precession of the axis of the earth. 

As a result of the polar flattening of the earth and the inclination of its axis 
(“ obliquity of the ecUptio ”) the attraction of the sun gives rise to a turning 
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moment when the earth is near either end of its orbit (fig. 9a). This moment, 
which seeks to bring the equator eloper to the plane of the orbit, vanishes at the 
equinoctial positions. However, since the angular momentum of the earth is so 
large, we know that the preoession will be very slow and an average moment may 
be assumed in computing the actual preoession. To be added to this is the average 
moment due to the moon’s attraction. Because of the smaller distance, the latter 
moment is of the same order of magnitude as that of the sun. The averaging is 
best done by assuming the masses of sun and moon distributed uniformly along 
the orbits they appear to have as seen from the' earth. For this purpose it is 
sufficient to assume ciroular orbits and to take their planes to be coincident. 
As seen from fig. 96, the gravitational moment seeks to bring the earth’s axis 
in co coincidence with the normal to the orbit plane, so that the vector represent- 



ing this moment must be perpendicular to the plane containing the earth’s axis 
and the normal. Hence this vector turns in the orbit plane with the processional 
motion, just as we saw above that the gravity moment vector acting on the gyro- 
scope turned in the horizontal plane. As in that instance, the earth’s axis de- 
scribes a cone about the normal to the orbit plane. The numerical value of the 
period of precession turns out to be about 27,000 years. 

Due to the fluctuating value of the moment, there is a resulting astronomioal 
nutation. This has no connexion with the wandering of the pole, whioh was 
explained above in terms of internal shifting of terrestrial matter. 

(4) Example of the gyrocompass. 

Suppose a gyroscope to be mounted in a frame capable of turning only about 
a vertical axis a, fig. 10. If the earth did not rotate, the gyroscope would maintain 
the direhtion of its axis of rotation, for then it would not be subject to any force 
moment. On account of the earth’s rotation around k with angular speed e>, the 
mounting — assumed for the present to be fixed to the earth— will be turned; 
and according to the third law, the rotor will exert a moment BT on the mounting. 
This moment is opposite to that which the turning of the axis would produce, 
and so by (48') it will amount to 

M' = — - Pco [kk'] (63) 

For an arbitrary position of the axis in a horizontal plane, this moment will have 
a vertical component SI/ as well as a horizontal one, M/. The former, according to 
§ 3, produces rotation about a, while the latter will simply be borne by the bear- 
ing L, beoause rotation about a horizontal axis is not possible. However, if the 
axis of the top lies in the meridian plane determined by the vertical and the 
earth’s axis (position k' of the latter), then vector M/ will be horizontal. Since 
there is no vertical component, the frame will not turn about a. In other words, 
the north-south position is one of equilibrium. The stable one of these two direo- 
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tktns is that having the smaller angle between ft and ft'. Because of the smallness 
of to, the acting moment is so small that it is able to overcome friction in the 
bearings only with great difficulty. The actual set-up resembles fig. 8. To com- 
pensate the moment exerted by the rotor on the housing, one need only attach to 
the latter a weight in at a point in the plane normal to the rotor axis. In order 



that this weight may exert a moment, the rotor axis must not be exactly horizontal, 
but must make a small angle 8 with this direction. Then, using the notation of 
fig. 10, 

mgaU"g^ + M' = - P^ikh'] = 0. . ... (64) 

Since a product vector is normal to the plane of its two factors, equality of the 
products signifies that the vectors k, ft', j" are ooplanar, Le. the rotor axis 

must again lie in the meridian plane when equilibrium is attained. 

Ex. 52. Ia the arrangement just described, compute the inclination of the 
axis of the top as a function of the geographical latitude <p and the mass m of the 
counterweight. Remember that for any given location the altitude of the pole 
is equal to the latitude. 

Ex. 63. A projectile fired from a rifled gun is given a spin about its axis; 
the projection of the trajectory upon a horizontal plane deviates from a straight 
line. How may this he explained qualitatively, and what is the direction of 
deviation for a shell having right-handed spin, i.e. clockwise rotation when look- 
ing along the tangent to the orbit? Air resistance tends to raise the nose of the. 
projectile. 



CHAPTER VIII 


Elasticity: The Mechanics of Deformable Solids 

1. The Geometry of Small Displacements. 

The ideal rigid body discussed in the previous chapter represents 
an abstraction, as is well known. Actually, the parts of a physical 
solid body, e.g. a bar of iron, are not absolutely rigidly interconnected, 
but suffer small displacements relative to each other under the in- 
fluence of forces. These deformations are, in general, so small that they 
can be negleoted in a large number of cases. On the other hand, the 
phenomena governed by elastic properties, e.g. the vibrations of a 
spring, are often of great importance, so that elastic processes offer 
a wide field for discussion in mathematical physics, a field which has 
been extensively cultivated, for practical reasons. We can take up 
here only a few of the chief problems and their solutions. 

In the light of our present concepts regarding the structure of 
matter, we can understand, in many cases, that considerable forces 
are necessary to deform a solid body. In the absence of external forces, 
the solid is in a state of internal equilibrium, in which its fundamental 
constituents {atoms or molecules) are at such distances apart that the 
forces between them are in equilibrium. Every change of these dis- 
tances, caused by external forces, immediately causes the attractive 
or repulsive forces to preponderate; their net result, which is either 
an internal pressure or tension, balances the external forces. We may 
construct a crude model by imagining the elementary- particles to be 
connected by spiral springs. This model shows us how a local de- 
formation must be transmitted to all parts of the body. The mathe- 
matical computation of the deformation resulting from given forces 
is a problem in Elastostatics, while Elastodynamics deals with the 
changes of elastic disturbances in time. For example, if, in tbo above 
model, we displace several “ atoms ” from their rest positions and 
then release them, they will move beyond those points in returning; 
and this vibration will spread throughout the whole body, on account 
of the connexions between the atoms. However, for purposes of cal- 
culation we shall not make use of the atomic picture, but shall treat 
the phenomena from a macroscopic standpoint, assuming that the space 
is continuously filled with matter. This means that the portions of 
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.space considered, and also their volume increments, are so large that 
they obtain a great number of atoms. 

We now wish to investigate the geometric-kinematic relationships 
for infinitely small deformations or strains. For this purpose we con- 
sider a region around a point P 0 , whose dimensions are so small that 
the squares of the distances of all its points P from P 0 may be neglected. 
On account of the strain, let each point experience a displacement 
S = ui -f vj -+■ wk, which we likewise consider to be small. More- 
over, let the displacement vary so little from place to place that we 
are justified in dropping products of the derivatives dujdx, du/dy, 
du/dz, dv/dx. ... If the point P 0 experiences a displacement s 0 , then 
another point P, whose position relative to P 0 is given by the radius 
vector r drawn from P 0 , undergoes a displacement 

s = « 0 -fP'grad-«. ...... (1) 


If we denote the components of s by u, v, w, the above equation may 
be written 

“• + (£)'•+ {£),»+ (£).* 

- ”• + *• g " d — ++ (£)." + ^).f + (I j).* 
rg “ d " = ”•+ («).*+ GjV+Cii). 

It is intended that the symbol 


/eUA 

\ dx )o 


should express the fact that, on account of the general hypothesis con- 
cerning the order of magnitude of the displacement components and 
of the volume under consideration, we may use the values of the deri- 
vatives at P 0 in carrying out the computation. Naturally, we rule 
out discontinuities, such as fractures or cracks in the body. 

Our equations give a linear relationship between the components 
of s and those of r. More important, however, is a related quantity: 
the connexion between an initial vector r = P$P and the correspond- 
ing vector / ss* P 0 'P' after displacement. This connexion may 
be looked upon as a mathematical transformation of the space 
* See pp. 2, 3. 
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about P 0 , determined by the deformation. From fig. 1 it follows t bat 
r' — r+ {s~ s 0 )~ r+ rts&i’s or r 1 — r — r grad • s, (3) 
or, in components,* 



Thus this transformation, ai&o, is linear. Since the conversion of r 
to r' is independent of the co-ordinara 
system we select, the coefficients of (4) 
represent the nine components of a 
tensor. 

The correspondence between the space 
about P 0 and that about F 0 ' is that of 
an affine transformation oi space, where 
— by definition — straight Lines are trans- 
formed into straight lines, and thus planes 
pass over into planes, and parallelism is 
retained. This is immediately evident 
in the ’vector form: A set. of parallel lines in the direction of i 
may be represented by means of a variable parameter A in the form 

r r, -f At, ....... (5) 

so that one straight line of the family corresponds to each r ( . If this 
relationship is substituted in (3), we obtain 

r'— r i +Xf+ (rci-At) grad • s~ r,+ r t grad • «■+ A{sf+ i grail * si. (6) 
This, however, is again the equation of a family of parallel lines with 
rj «* r t + r t grad - s, f — i + i grad * &\ . . (7) 

The transformation (4) may include still another part, which 
represents the rotation of the neighbouring apace as a rigid entity. 
In this case, the vector r is altered, as before, but no internal elastic 
forces arise, since there is no actual deformation of the volume under 
consideration. If we v,ish to find the forces attributable to changes in 
the distances between particles, we must deduct from (4) the part 

' * For abnpJidtjr, the s&bHoripts of the derivatives are now omitted. 
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concerned with pose rotation. This may be done in the following 

manner: 

On p. 140 it was shown that tqe velocity of say point of a rigid 

body rotating with angular velocity to is given by 

v = [tap']. . (8) 

Forming the curl, we get 

curl v — [V[w»']] = to • V#' — <*>V • r 

— to div r — to grad • r — 3<*> — i<s =■ 2to. . (9) 

Thus, for a given velocity field, curl » represents a rector equal to 
fiotce the angular velocity vector.* 

If a rigid body is thought of as rotating for a short time <U, it is 
found that the change in the radius vector f caused by this rotation is 

d mt r — [<ar]dt = ^[rot ~-dt,r\ = |[rot s, r], , (10) 

The difference between the vector r + d rot r and the vector r' as 
specified by (3) represents the change in v due to pure deformation, 
which we denote by <Z dct v. 

d det r=r~r~d cot r. ..... ( 11 ) 
From the deformation alone we then have the relation 

r" = r + a' def r — r - !- r grad * -s — £[euri s, r], . (12) 
or, in Cartesian form, 


(12a)- 

*Cc + »> + ‘Gi + S)' + ( 1+ ^)‘ j 

The coefficients, in (12a), now form a symmef.ric tensor, which may be 

* This is .still true it the whole body is subject, in addition, to a translation vr, for 
sinoe Vr is the same ior all particles, it drops out in forming the carl. 

y Actually, one should write curl ds, but according to the original assumptions all 
displacements are assumed to be small quantities. 


„ / 1 i ° M \ i i/3« , 3t>\ . ifdw . du\ 

* - 1 1 + nr + *U + &)» + % + 

,, 1 /3m . dv\ / dv\ ,/dv 3 w\ 

y = + si)* +{ 1 + r y )y + Hr, + ah 
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visualised by means of an ellipsoid, as at p. 37. This ellipsoid, which 
has the equation 


(> + £)-+ (*+!;>■+ 0 + %y+(M + s)* 
+ (n+Ii)-+Cs+ s )-- 1 


(13) 


may be called the ellipsoid of dilatation, or ellipsoid of strain. Its 
principal axes are called the principal axes M strain. As shown at 
p. 37, the direction of r" is that of the normal to the ellipsoid drawn 
through the end point of r. The two vectors have the same direction 
for the ends of the principal axes. This means that only a segment 
having this direction maintains its original direction. Only those seg- 
ments lying in the directions of the axes of the strain ellipsoid experience 
pure extension without change of direction. 

The coefficients 



have a definite physical meaning: if we consider the termini of the 
unit vectors along the axes, for the undeformed body, then, correspond- 
ing to the vector i. we have, by equations (12a), 

,, ( i i 3m\ • , i /3m , 3«\ . , , (dw . du\ , ..... 

,, -(l + 5 ;). + i(5 J + 5 i )7 + i(j; + 8 -)*. <U) 
and for the vector j, 

r ” = l: (if + £) * + ( 3 + r y ) J +l \iz + if) k - (15) 

Neglecting ail terms which may be considered small according to the 
s,a?.”jXsptions made above, we have 


[£," | - j»| Vr-j'z- 1 _ 3m 

i i | 1 ’ ' 3*’ 

i rj’ j - \J | 3 V 

I/I 3/ 


(16! 

( 17 ) 


This means that the quantities 


3m dv dw 
dz’ dy ’ dt 



MECHANICS 


[Chap. 


give the relative changes in length of segments lying in the x-, y-, and 
^-directions respectively. 

In the undeformed state of the body, the vectors i and j are per- 
pendicular to each other; after deformation, they form an angle which 
we may designate by w/2 — since it differs but little from tt/2. Then, 

• • < 18 > 


if we use the values of rj' and j rj' from equations (14) and (15), and 
if we neglect all products of the derivatives, we obtain 

*>*»«. V (!*+£)•- • • • • < 19 > 

Thus the tensor component signifies the change in angle, between 
two segments having the directions of the x- and y- axes, resulting 
from the deformation. Similar results hold for the other components. 
K the axes have the directions of the principal axes of the tensor 
ellipsoid, then the terms T iS , T 23 , T ai are absent from the co-ordinate 
representation. This means that the axes of the strain ellipsoid remain 
perpendicular to each other, which we already inferred above from 
the fact that segments in these directions suffer pure contraction or 
expansion. 

To calculate the change in volume per unit volume (the so-called 
“ cubical dilatation *’), we need only find the difference in the volume 
of the parallelepiped formed by the vectors i,j, ft, before and after 
deformation, i,e. the difference [rj' rj' r*"] — ■ 1. On writing out 
the determinant we see immediately that this expression, to a first 
approximation as before, gives 


_ 3« , 8® , dw 
~ dx dy dp 


— di vs. 


• • ( 20 ) 


It is more convenient, for what follows, to consider the strains them- 
selves, i.e. the components of the vector r" — r. These are obtained 
at once from equations (1 2a) (p, 164) by bringing the components of r 
over to the left side. Introduce the following notation for the deri- 
vatives and their combinations, the physical meaning of which was 
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We then may sprite, by (12a), 

*" “ * = «u® + &nV + } 

y" - y « Ki® + %»y + fo»* * • • • (22) 

— * = Hi* + Htf + ess* 1 

The tensor ellipsoid corresponding to this symmetric vector function 
has the same axial directions as the ellipsoid of strain (13) (p. 166). 
This may be shown readily by calculation, but also can be seen at 
once in this way: if fcha vector r" is to have the direction of r for the 
axis, then this must also be true for the difference vector r" — r. 
The mixed Sensor components vanish when referred to the principal 
axes. Denoting by accents the co-ordinates referred to this system, 
we have remaining. 


The quantities <%, % e% are called the principal extensions. 

Bx, 54. Compute the relative change m length of a segment whose direction 
cosines, referred to the axes of the ellipsoid of dilatation, are a,, ot^, « s . 

2. State of Stress of a Body under Strain. 

If wa consider an element of volume dr, of mivss dm. in Ihe interior 
of a strained elastic body, we recognise that two kinds of force are 
operating. First, there are forces like that of gravity, applied to all 
the individual constituents, whether they be in the interior of the 
body or on the bounding surfaces. If the body is being accelerated, 
the "inertial resistance ” 


is also a force of this type. But, in addition to forces of this kind, there 
are forces operating on the surface particles which are due to the 
presence cf the contiguous particles. These correspond to the springs 
in the mode, described on p. 161 . Since these forces operate between 
adjacent particles on either side of the bounding surface, they are not 
proportional to the volume of the element, but to the area of its sur- 
face. The value of the force acting upon an element of surface, re- 
ferred to unit area, is called the mild stress, or simply the stress, P. 
The direction of this force need not by any means be normal to the 
surface — in general it is composed of a normal stress (pressure or 
tension) and a tangential (or shearing) stress. We can show, however, 
that the stress P on any surface element may be found in terms of 
the normal direction, provided that the stresses are known at this 
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point for three mutually perpendicular directions, which we choose as 
axes of a rectangular coordinate system. Thus, we take the volume 
element to be a small tetrahedron, of which three faces — dS x , dS v , dS, 
— lie in the co-ordinate planes, while the fourth — dS — has the outward 
directed normal n (fig. 2). Let the stress on dS be P. In addition, call 
the stress acting an the surface whose normal is along the jc-axis P v 
and the stresses on the faces perpendicular to the y- and 2 -axes P 2 
and P„ respectively. The vectors belonging to the faces which lie in 
the co-ordinate planes have tiro directions of 
the negative x-, y- and s-axes. The stresses 
Pj, P 8 , P 3 therefore denote the Besses / n 

exerted by the laces of the tetrahedron on j 

the adjacent surface elements, whose normals j k ^ x \ 

are along the positive axes. According to - J >, -V-_ 

Newton’s Third Law, the stresses exerted /id 
by the neighbouring surfaces on the tetra- 
hedron are then — — P gJ — P 3 . If we j* 

now formulate the conditions of equilibrium Fig . 2 

for our infinitely small tretahedron, wo need 

consider only the forces proportional to the areas, since the forces pro- 
portional to the volume vanish to a higher order as we diminish the 
dimensions of the element. Then the first condition of equilibrium 
— the vanishing of the resultant force—is 

P(B-PA-'-P 3 dS„-PA = 0. . . (24) 

Now ndS ** 4dS « + jdS v + kdS x (25) 

that is, 

dS x -- dS{ni) --= dS cos a; dS v •— dS cos /J; dS t ■— dS cosy, (26) 
whence P ~ P, cos a -f P 2 cos S -f- P 3 cosy. . . (27) 

Denoting the s-component of P x by P u , that of P 2 by P 21 , the 
y-eomponent of P x by P 12 , &c., we obtain the component represen- 
tation 

P. x — cos a -j- jP 21 cos /? P S1 cosy 

P v — P 12 cos a Pj 2 cos /J 4- P 32 cos y i . . (28) 

P t — P n cos a -f P 23 cos )3 + P 33 cos y I 

This equation gives every normal direction it a corresponding stress 
vector P. 

The meaning of the coefficients P ik may be seen by considering a 
cube of edge unity, whose faces are parallel to the co-ordinate planes. 
Then, for example, for one of the faces parallel to the yz-plane, 
cos a — 1, cos j3 = cosy — 0, 
i.e. P,= P„ t ‘P v = A,, P a =Pi a , 
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so that the normal component P B is represented by Pn, and the two 
components lying in the surface are represented by P 12 and P 1S . Hera, 
then, the coefficients having two eq^al subscripts give the normal 
pressures or tensions, while those with mixed subscripts represent the 
tangential stresses.. 

The coefficients again represent the components of a tensor, since 
the connexion between F and n must, of course, be independent of 
the particular co-ordinate system in which the vectors are resolved, 
and must always lead from a given vector n to the same vector F. 
The tensor defined by the quantities P<* is symmetric, as will be shown 
in the following section. It may therefore be represented again by an 
ellipsoid, called the ellipsoid of stress. If the co-ordinate axes are taken 
along the principal axes of the ellipsoid, the mixed tensor components 
drop out, and we have only the three coefficients P v P 2 , Pg, corre- 
sponding to the diagonal terms of the matrix of coefficients. These 
three quantities are known as the principal stresses. The relationship 
between the components of the stress acting on a surface 'element 
normal to n and the principal stresses is very simple when expressed 
in this system of co-ordinates. Denoting the quantities referred to 
this system by primes, we have 

Ps- — P 1 cos a') 

P,,~P 2 cos/H ....... (29) 

P«- = P 3 cos*/] 

3. Conditions of Equilibrium of an Elastic Body. 

Consider the equilibrium of the forces applied to a finite volume of 
an elastic body. If we denote the resultant of the forces proportional 
to the volume — the body forces — acting on unit volume by g, and 
apply the fundamental equations of the mechanics of systems of 


particles, we obtain 

f gdr -f- (f) FdS — 0. ..... (30) 

f[rg]dT+f[r*]dS~0 (31) 


The surface integral is to be taken over the bounding surface of the 
volume under consideration. In adding the stresses acting upon the 
several volume elements, the internal stresses on the boundary be- 
tween two elements cancel out, since these stresses are applied in 
opposite directions on the two sides of such a surface of separation. 

If we take the ^-component of equation (30) we have on the left 

Jg x dr+ j)P x iS= J g x dr+j) (P u cos a + P^ cos )3 -f P a cos y) dS. (32) 

The surface integral may be brought into the form appearing in Gauss’s 
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Theorem and then transformed into a volume integral If we introduce 
another vector P x : 

P 1 ==P u *+P 2 J+P 31 £. .... (33) 

We shall show, below, that the tensor P ik is symmetric, so that Pj is 
identical with P x . Using this new vector, 

(f P x dS — (j) (P x n)dS — <j) (P^S) = j div P x dr. (34) 

Since the condition of equilibrium must hold for every volume, we 
Lace, writing the analogous equations for the y- and z-eompoaenbs: 

. ,. rg . 3P U . SPj^ 3P„ n a 

y x + divPj « g a + - ? -f + = 0 j 

9* + divP, - g, H- ^ 4- -|f « & [ > (85) 

g, + divP 3 = g, -f -f -4- = 0 j 

Again, the awjom&oaeat of the equation of moments becomes 

f(yg* — + $ (yP B —*p t )dS—o. . . (36) 


Just as before, we may w*rite 

j){yP z — zP^ydS— $ (?/P« — sPjjJdS -- / (div.yf’g — divsP~)dr. (37) 

Now v 

divyPg — Vdivfsrf (g*adp) — ydivP 8 -fp s j = «divP 3 +P aa , (38) 

bo that the ay-component of the equation of moments becomes 

y(g t 4- divP g ) — z(g„ + divf*) — P 32 + P m = 0 (39) 

But, according to the hist condition of equilibrium, the bracketed 
expressions are zero, and so we arrive at the proof of the symmetry of 
the stress tensor: p „ , ,, v 


Similarly, from the other two component equation? of (31), 

P-is “ P&il Pa l ~ Pisr ■ • ' * • ( 43 f ) 

In addition to the conditions for equilibrium of internal forces *«ve have 
those for the surface forces. If the stresses P* on the surface elements, 
caused by the external forces, are known, then equations (28) (p. 168) 
hold again, and we have 

Fj =* P n eos a -j- P 2 i cos,S + P %x cosy] 

Pv — Pi* cosa + P w cos jS + P ss cos y [ . . . (41; 

P/ — P x9 008 a + P a8 cos /8 + Pgg cosy)' 
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4. Relations between the Strain Tensor and the Stress Tensor. 

The conditions of equilibrium (35) are not yet sufficient, for the 
answer to the question “ What equilibrium state of strain results whan 
given external forces are applied?” We still lack the principal thing— 
the connexion between the stress and the strain. Since each consists 
of six components, we must expect, in the most general case, that each 
component of the strain tensor is a function of ail the components of 
the stress tensor. Assuming a linear relationship, this leads to a system 
of equations with thirty-six coefficients. This number reduces very 
considerably, according to the symmetry properties of the substance 
under consideration. If we consider only isotropic substances (he. not 
crystals), then, as we shall see, two constants are sufficient. For an 
isotropic substance, the axes of the stress ellipsoid must coincide with 
those of the strain ellipsoid, for the former give the directions for 
which the stresses are normal to the surface'; sad for an isotropic 
body, only contractions or extensions, but not lateral displacements, 
can occur in these directions. Now, according to § 1 (p. 165), this is 
the essential property of the axes of the strain ellipsoid. We shall 
therefore express the connexion betweeh the stress and strain tensors 
in this uniquely indicated system of co-ordinates. The foundation 
of the ao-calied classical theory of elasticity is Koohe’s Law, which states 
that the norms? stress is proportional to the elongation per amt length 
(** at tensio, sic vis '*). It is understood that we limit ourselves to small 
strains. fw—accordir.g to experience — Hooke's Law is ne longer 
valid for larger deformations. If we write this law for the axis 1 in 
the form P% «= 8%, than S is the linear modulus cl elasticity, or Yctmfs 
modulus, for the material >& question. We .know from experience that 
a- body undergoes & contraction. of its transverse sections when stretched 
in the direction of its length. The ratio of the transverse ocsntracrioa— 
i.a. the change in length per unit length of a transverse segment- -to 
the change in length per unit length of the body jb. according: to ex- 
periment, constant within the limits whore Hooke's Law holds. This 
ratio is called Poisson’s MaUo <r. That is, if stresses P t , Pj, P* act m the 
directions of the axes, then the tension P 3 oauses a ctmUmdion along she 
axis 1. This relative oontoaoti n is given by ne s , at* ' since P„ fi t it 
amounts to (ojE)P^. Similarly, the tension P t causes a ©ontrctotiba of 
amount (c/P)P s . The total relative change in length ia thus given by 

p (P > * | 

(v.+p.8-^.{j , r- r p;(0.+iv+^; s. m 
jF.-'dVt AS ” -;7 pv-tf; A+Pi+Aj ; 
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It remains only to transform these equations, derived in this way for 
the special co-ordinate system of the principal axes of the ellipsoids, 
to an arbitrary co-ordinate system, since, in general, the principal 
axes for a strained body change in direction from place to place, 
and thus are not suited to describe what occurs. Let the axis 
1 form an angle (I, x) with the x-axis, an angle (1, y) with 
the y- axis, and an angle (1, z) with the 2-axis, and let the angles 
between the axes 2 and 3 and the new axes be denoted in similar 
manner. Further, let us characterize the principal axes 1, 2, 3 by the 
unit vectors c v c 8 , c 3 , the new axes by the unit vectors i.j, k. Then 

i = c x cos(l, x) -f- c a cos(2, x) + c 8 cos(3, a;} | 

j = c x cos (1, y) + c 2 cos (2, y) + c 3 cos (3, y) j . (43) 

k — c t cos(l, z) -j- c a cos (2, z) 4- c s cos (3, z) ) 

By the rule for transformation of tensor components proved on p, 34 

we obtain 

e , cos 2 (l, x) -f cos 2 (2, x) -j- e 3 cos 2 (3, a;) 
h cos 2 (l ,'y) + e 2 cos 2 (2, y) + c a cos a (3, y) 

Ct cos 2 (l, z) + <? a cos 2 (2, z) + e 3 cos 2 (3, z) 
e n — ej cos(l, x) cos(l, y) + h cos(2, x) cos(2, y) 

4~ ^008(3,*) cos (3, y) )■ <**) 
e sst ~ h cos (I, y) cos(l, z) -f e 2 cos(2, y) cos(2, z) 
b e 3 cos (3, y) cos (3, z) 

^3 — ' Cj cos(l, z) cos(l, x) -f e 2 cos (2, z) cos(2, x) 

-f- 1% cos (3, z) cos (3, a;) . 

Jt is evident from these expressions that the sum 
<Su + e ga + 633 = div 5 

is equal to 4- e a + independent of the co-ordinate system. Such 
quantities are called invariants of a tensor. A second invariant which 
will be useful later (cf. p. 176) is the expression 

hihst + ha e 33 + CgsCu — («w a + + %i 2 ) — h e s + hh + e z e i- (45) 

The same transformation formulse hold for the components of the 
stress tensor P ik . Thus, expressing e v eg, ^ in (44) in terms cf P v 
P 2 , P 8 , using equations (42), and also the relationships 

P 1 + P a + P 8 = P 11 + P aa + P 33 

and cos 2 (l, x) + cos®(2, x) cos 2 (3, ®) = 1, 

cos(l, x) cos (1, y) + cos (2, x) cos(2, y) 

-)- cos (3, a;) cos (3, y) — 0, &c.. 


hi — 
e 22 — 
633 — 

hz — 

hi — 
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we obtain 

«u = ^ {/>*- ^ (P U+ P 22 +P 8S )} c li ^ 2 S^plp\ 

^“T{ p »-"r^( p “ +p » +p “)} e ^ 2 ^p )p ^\- w 

*83 ~ |-P S3 jqp^CPuH"^! n~V P Ss)} C 81 ~ ' ' P SI j 


The second triad of equations, which is absent if the co-ordinate 
system is that of the principal axes, has the following meaning: Con- 
sider again a cube whose edge is unity, 
and whose faces are parallel to the co- 
ordinate plane®. is the force acting on 
one of the faces normal to the re-axis; 
the opposite face is assumed to be held 
fixed. Then P 12 is the tangential com- 
~*p ponent, which is in the direction of the 
n y-axis. The change in the angle between 
the a;-axis and the ^-axis is represented 
by e 12 (cf. fig. 3). According to the first of 
the equation® under discussion, these two 
quantities are proportional to one another. The factor of pro- 
portionality 



E 

2(1 + a) 


(47) 


is called the shear modulus, and is usually denoted by p or ». This 
quantity, however, is not a new and independent constant of the 
material, but is determined by the values of E and of a, as in (47). 

Similarly, the modulus of compression, or bulk modulus, k, or its 
reciprocal, the compressibility i< — are already determined if E and o 
are known. We define k as the change in volume per unit volume 
under the influence of unit normal pressure applied equally from all 
sides. According to this definition we have [cf. equation (20), (p. 166)] 


YLzlX *11 ~b *22 + ^38 

-VP' P 


(48) 


Putting in the values of e ik from (46), we have 


*— l —gp~( P U+Psi+ P 3s) 


3(1 - 2a) 
E ‘ 


. (49) 


Since the application of tension to the surface must cause an increase 
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in volume, a cannot be greater than 1/2. Were a — 1/2, the substance 
would be incompressible. 

Solving the equations (46) for the components of the stress tensor, 
we obtain 


Pu = 5^2; (^ f^+^3)}, ] 

P ™ = rhr { Cs2+ 1^35^“ 4 % 4 fl#s) } * } < E0 ) 

p 33.= fx^ r.r^ (c u ^ «8»)}> p a * 2(i5 —) %* } 


If we now substitute these values of the stresses in equa tions (35) 
(p. 170), and if w»4gaia introduce the values of s <4 from equations (21) 
(p. 166), we obtain a system of differential equations containing only 
the displacements «, v, u> and their derivatives. First, the <e~ com- 
ponent becomes 


B j 9 3*« , 2a /?*« , 3% , 

9x 4 2(1 +“ ff ) ‘ 1 — 2c/ \3z* 4 3*3 y 4 


dxihj) 


E ]8% , J>% , JV_ , 3*m 

’ 2(1 -(• a) Uy 8 dxdg~ r dxdz 1 §ri 


This may bo written 

JS I 3 idu , 3t> 3«s\\ . 

9 * 4 2(1 +1?) i d 1 1- 2<r dx {dz dy 4 S'* ) J • “ w * 

Similarly, 

, S j . , 1 3 /8« . 9® , 3w>\"< . /K1 ,. 

* 4 2(1 + o) l 4 F-2a dy Wx 4 Ty ^ di } I " {51 } 


3 ° 4 20TT) ! Aw 4 Si (ss 4 If 4 f ?) } ~ <w "> 


In vector form, this system becomes 

t ~r { a* + fzr% ^ ad iw 8 j - c - * ( ?2 > 

This is the general system of equations ter the distarriou of m isotropic, 
elastic sobstauce. In. addition, we have the surface boundary con- 
ditions, which may ha given in tire form of adorn ation coopeming the 
displacements or tri# distributioft of stresses on thp surface. 
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5. Energy of Elastically Deformed Bodies; Elastic FctenF&l. 

Work is required to bend a bar elastically. This work remains 
stored in the bar in the form of potential energy as long as the defor- 
mation persists. It may fos drawn upon by letting the bar go, where- 
upon portions of the bar attain kinetic energy and go beyond the 
equilibrium position. The energy of elastic deformation is potential 
energy of the forces operating between the particles and is distributed 
uniformly throughout the body. It has its greatest value at places 
where the deformation is greatest. In view of the continuous distri- 
bution, we look'for an expression for the energy density — the limit of 
the ratio energy /volume of an elementary volume.* 

To obtain an expression for the energy density V, we start with 
a cube of edge equal to unity. In this case the energy content of the 
cube is equal to the energy' density. Let the edges of im cube be in 
the directions of the principal axes of strain. As h result, the cube 
suffers only extension or compression, but not shear. In its deformed 
state, the face perpendicular to axis 1 hafc an area of (I 4- %}{1 + e»), 
and so the force acting on it is P^l -j- e sl )(l -f- e 3 ). If now the edge 
parallel to 1, whose deformed, length is (1 -f- e^, ware streichcsd &» 
amount. Se,. the stress 1\ would do an amount of work equal to 
SA, - .?,(} -f ^)(l + % )he x . 

(The increase ia stress due to the added deformation would make a 
contribution which would be of only second order.) The above amount 
of work represents the increase in potential energy due to the assumed 
change in rkformafciou. Further, the quantities and e A in the paren- 
theses may be neglected in comparison with unity, so that the increase 
in potential energy duo to the increase in length of the axis 1 will 
amount bo 

8 h\ --= |y 84 = P^ej, or P x = „ . (53) 

Analogous expressions result for the other axes. Substituting the 
valuer of P lt P 9 , P 3 from (42) or (60), we obtain 



* It ir.uctt bo iwMttQbswa, however, the* t.eoor’iK® fc? the discussion os p. 3 the 
eiemant of vo'smw. mat vat- be made too taxM. 
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It is easy to see that these equations are satisfied by setting 

U(e v e f: , e s ) = JW + ej + ej) + ~ ^ (e 1 -f.e 2 + «s) 2 ] . (55) 

This expression, which gives the energy content as a function of the 
principal strains, presupposes the use of the principal axes as co- 
ordinate axes to be used at each point, and this system will change 
from one point to another. For this reason, it would be desirable to 
transform the relation to a form that holds for the whole body. To do 
this, we do not have to use the general transformation, since the quan- 
tities involved here may be expressed readily in. terms of invariants of 
the tensor. The second parenthesis is such an invariant as it stands. 
Further, because of (45), 

e* + ca 2 + e 3 3 — («i + «2 + es) 2 — 2 («ie 2 + + e &) 

— ( fi n + e 22 + e zz'f 

— 2[e 11 e 8a + <^33 + e 33 e ll ~ (e^ 2 + e 23 s -j- e 31 2 )]. 
Then U will be expressed in terms of the e ik as follows : 

( e «c) — 2(Y.f o) [lZT2ff (eu e * 2 e ' i3 ^ 2 + e 22 e 33 + e u ) 

+ W + % a + e 31 2 )] (56) 


Comparison with (50) shows again that, in general, 


P ik 


<ru 

de ilc 


(57) 


Finally, the deformations in (56) may be expressed in terms of the 
stresses, giving a quite analogous expression which, however, will not 
be written down. The quantity U(e (k ) is rightly called the elastic 
potential, since the stresses are obtainable from it by differentiation. 

6. Elementary Treatment of the Bending of a Cantilever Beam. 

The exact solution of an elastcstatic problem requires the in- 
tegration of equation (52), subject to the boundary conditions a r ply- 
ing in the special case considered. In general, this procedure leans to 
extremely complicated calculations. For this reason, approximate 
solutions have been devised for many technically important problems. 
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Certain simplifications are made in the initial assumptions, and the 
results are sufficiently accurate for many purposes. As an example 
of such a simplified treatment, we give J. Bernoulli’s theory of the 
bending of a beam of uniform cross-section. The Bernoulli theory is 
based upon the following assumptions: Imagine a certain point- 
say the centre of gravity — to be marked in every cross- section. The 
resulting straight line, which we call the axis of the beam, will be 
bent into a curve which lies in a vertical plane, provided that the 
bending is caused only by weights or 
other vertical forces. Further, imagine 
the beam divided into a number of thin, 
horizontal layers, each made up of fila- 
ments. During the bending, there will 
be one layer in the middle, the fibres in 
which are not changed in length, and 
which is therefore called the neutral sur- 
face. Filaments lying below the neutral 
surface are compressed, those above are 
stretched (of. fig. 4). The flexural rigidity 
or stiffness of the beam is assumed to 
be conditioned solely by these changes 
in length. A further assumption of the 
Bernoulli theory is that plane sections 
normal to the axis of the beam remain 
plane and perpendicular to the axis after 
flexure. With these assumptions, it is not difficult to calculate the elastic 
stresses acting upon any particular section. If we consider two neigh- 
bouring cross-sections, distant dx from one another, then these sections 
will form an angle d<f> with each other after bending. Measure the x-co- 
ordinate towards the right from the fixed end of the beam, and take 
the z-axis vertically upward from the neutral fibre, at the fixed end. 
The y-axis is horizontal, and directed across the beam from the nearer 
face. If p is the radius of curvature of the neutral fibre, then by fig. 4, 
a layer which is distant z from the neutral surface experiences a relative 
change in length amounting to 



( p -j- z)d<f >. — pd<f> <&£ 

dx dx p ’ 

since d4>jdx is the curvature 1/p. 


(58) 


The force acting upon an element dzdy of the cross-section is* 
according to the definition of Young’s modulus E, 


dF ~—dz dy. 
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The total force acting upon the entire cross-section is then 

F J Jz dzdy ( 59 ) 

This resultant must vanish, since neither a change in cross-section 
nor a longitudinal displacement of Any section ran take place In pure 
bending. This means that the neutral surface always passes through 
the centre of gravity of the orofw-section, for this is the origin for which 
Jf z dzdy vanishes. On the other hand, there exists a resultant moment 
‘of the elastic forces which tends to turn the cross-section about tho 
line of intersection of the neutral surface with this section. In fig. 4, 
this moment is in. the direction of the negative y-axia. Its magnitude is 

jiff j fs dF « ~ ffr dzdy = ~I: . . (60) 

The geometric quantity Jj's Pdzdy is called the moment of inertia I 
of the cross-section. 

Let the cross-section bo rectangular, as assumed in fig. 4; if we 
call the breadth h and the depth o, shea the integration is to be 
extended from y ■■= 0 to y = b and from z —c/2 tc t ■— -f-o/2. 

The moment! of the elastic forces must balance the moment of the 
external forces. If the bending is caused by a load Q (in force units) 
applied at the free end of the beam, where * « a, then the moment d 
this load at the point x is, since the portion a — x may bo considered 
a rigid lever arm, 

j M' i — Q (« — a?). (61) 

Tho direction of this moment vector is that of the positive y-axis. 
The condition for equilibrium cf a cross-section distant x from, the 
fixed end is thus 

— Q(a~ x). (62) 

P 

We can now readily obtain the equation of fchs neutral axis after 
flexure. Since the curse departs hut little from the horizontal direc- 
tion, we may write 

1 d‘S 

f> && 

at ail points.* Thus we obtain the equation 



* The aegaiive sign cotaea how the f»« that ihc unrve k, concave toward the 

negative #-a.djs. 
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Integrating twice, there results 

Q /ax* aA . 



m 


Since z and also izjdx vanish at x 0, both constants 
of integration are zero, and we have for the depres- 
sion of the end of the beam (at as -■ a), 

. Q_ a& 

^ ~ 31 3' • • • 


(65) 


The deformation of the cross-section, which was 
not taken into account by Bernoulli, may also be 
given without difficulty, provided the cross-section is 
initially rectangidar. Since a transverse contraction 
takes place above the neutral surface, where the fibres 
are stretched, and a transverse expansion,, occurs below 
this surface, the, originally vertical sides of the rectangle 
must converge toward the top, so that, the section now 
has the form of a sector of a circular annulus (fig. 5). 
Let p’ be the radius of curvature of the line of inter- 
section of the neutral surface with this cross-section, 
and call the radius of curvature of the neutral fibre 
p, as before. Following the same method as above, we find that 
the relative transverse extension (in reality, contraction) of a layer 
which is a distance z above the neutral surface is given by 

— s) d<j> — p'd(j> d<j> ^ z 

dy 'i “ % ~~ p 


Mow the longitudinal dilatation at the same point was found to be 
+s/p, by (58). According to the definition of Poisson’s Ratio, we 
then have 

a — — — , or p = — . . (66) 

P a 

Thus the originally plane layer at height z is bent into the form of 
an auticlastic surface whose two radii of curvature are in the ratio 
a. This fact in the basis of an elegant and simple method of determin- 
ing cr (see Ex. 56). 

Esc. 55. Calculate the defieetion of a tube of length a, radii and r r supported 
at the ends and carrying a concentrated load at the mid-point Starr: In order to 
reduce this to the above cane, imagine the middle of the tube clamped fast, and 
an upward force equal to half the load applied at each end. 

Ex. 66. A plate of rectangular section, supported along its central lino, is bent 
by load;? applied at tbs ends. A plane parallel plate of glass is laid upon the bent 
plate at the centre, and the system is illuminated boa above by monochromatic 
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light. The Interference fringes (p. 410), appearing at the thin layer between the 
plates, furnish a contour map of the surface resulting from the bending. By 
approximate representation of tbe surface in the neighbourhood of the point 
of contact, show that the contour lines are hyperbolas, the angle between the 
asymptotes being related to a in a simple way. Hint : Observe that for the point 
of eontaot 


px 2 p’ dy * f 




and that p and p' may be considered constant in the neighbourhood of this point. 

Ex, 57. A wire of leiigth l end radius of cross-section a is clamped at one end, 
the othe»- end being twisted through an angle 9. Compute the resulting moment} 
of torsion, which will be proportional to 9, by imagining the wire divided into 
thin, concentric hollow cylinders, and these in turn subdivided into narrow rect- 
angular elements that suffer pure shear when the wire is twisted. 


7. Waves ia Unbounded Plastic Media (Seismic Waves). Longitudinal 
Waves in Bars. 

The fundamental equation (52) (p. 174) holds for elastoirinetics 
as well as for elastostatics. We wish to apply it to the kinetic case 
and study the disturbances arising in an unbounded medium sub- 
jected to no external body forces. In this connexion, the inertial 
forces given by d’Alembert’s principle are 



per unit volume, p being the density. If all other body forces are 
absent, equation (52) becomes 

? »“5iTTT){^ + r=li graddirs }' ' m 

From among the many integrals of this second-order partial differen- 
tial equation, we select in the first instance the simple special case 
where the displacement vector a depends only upon one co-ordinate 
---Bay x — and thus derivatives with respect to y and z vanish. In 
co-ordinate form, we then have 

o 2 « E fd 2 u 1 3 2 w) E(l—o) d*u\ 

P dP 2(l + a)t0z 2+ l— 2a 3z 2 J~~(l+a)(l — 2 ^ 02 ? 

_ E 

p dt e ~ 2(l+o) Sx» 

d 2 w E d’-w 

P W ~ 2(1+ a) dx* 


)■ ( 68 ) 
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These are, however, equations of plane waves propagated in the direc- 
tion of the axis, as comparison with the wave equation (47) (p, 82) 
shows. The coefficient of dhijdfi gives the square of the reciprocal of 
the wave (phase) velocity, if the coefficient of dht/d a? is reduced to 
unity. It is evident from (68) that the wave velocity for a displace- 
ment along the x axis — i.e. in the direction of propagation— differs 
from the velocity for displacements normal to this direction. Waves 
of the first kind are called longitudinal — those of the second kind, 
transverse. The wave velocities are 

^ = V pd+ o) (I - 2a)’ • “ • • (69) 


W* - 2p{l (70) 

There is another less formal way of arriving at the classification into longi- 
tudinal and transverse waves, which hag more direct physical significance. II 
we replace As by grad divs — curl ourl s, according to the familiar mie of vector 
analysis (p. 40), then (87) becomes 

' " r> 

We now divide s into two parts, wiiiob are to satisfy the conditions: 

curl Sj — 0, div r, d= 0 ( ^ 

div s t — 0, curl + 0 / ’ f * ’ * ' * 


This means that represents a compression**! wave, since, by equation (20) 
(p. 166), tne volume dilatation is measured by 




Bit , 8v 
Bx h ' By 


8w 

a*’ 


At the ft 
ourl 6-j = 
has only 


e time, the individual volume elements experience so rotation, since 
. On account of the postulated independence of y and z, this part 
^-component; 


0»«, _ E( 1 - a) 

9 dfl = (1 + a) ( 1 - 2a) Bx *' ' " ’ 


Coraparisostf with (68) shows that the compressional wave is identical with the longi- 
tudinal wave. On the other hand, compression is absent for s s , since div s 3 — 0. 
As a consequence of the vanishing of the divergence, and of the vanishing of BvJBy 
and Bw s /Bz separately, BuJBx also vanishes. The non-compressiona! wave of 
shear, as we may call this part, has therefore no component in the direction of 
propagation, and is thus transverse. The actual proof of the identity of this 
wave with the transverse waves of (68) is obtained by writing the equation^ for 
the y- and z-components of (67'), which yield 


8*v t r _ E 
P 31* “ 2(1 + a) 8®* 




These are the equations (68) obtained above for the transverse waves. 


• (73) 
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The ratio of the velocities of propagation of compreaafaraal and 
shearing waves is 

i'r/r^- <"> 

The value or--- 1/4, which holds for many substances, makes this 
ratio equal to ye-. In earthquakes, both types of wave are observed; 
observation of the ratio of the velocities permits the calculation of a 
for that part of the interior of the earth in which the waves travel, 
The value thus obtained is 0*29. 

Apart from these waves in the body of the substance, there are 
longitudinal as well as transverse surface waves which follow the sur- 
face of separation of two media having different elastic properties. 
These waves correspond to the electrical surface waves treated on 
p. 345, and are called Rayleigh waves . after their discoverer. 

The propagation of waves in an unlimited medium is to be dis- 
tinguished from propagation along a bar whose transverse dimensions 
are small compared to its length. If we imagine the unbounded medium 
divided into such bars, whose axes are along the direction of propa- 
gation, we see that the boundary conditions are different for the single 
isolated bar and for a bar imagined out from an unlimited medium. 
In the former case the transverse contraction can take place without 
hindrance, while the surface of the bar in the second case is elastically 
bound to the surroundings, and so a transverse contraction causes 
counter stresses.. For this reason it is much easier to treat longitudinal 
waves in thin bars. We need not go back to tire general equations, 
but ,can obtain the differential equation in the following simple way. 
Consider an element of the bar included between two cross-sections 
distant ax apart. Call the extension on the left side of this element 



and that on the right side 



Then, if the area of the section Is A, and if Young’s modulus is %, the 
resultant force acting on the element is 



and Newton’s Second Law gives 
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The velocity of propagation of longitudinal (compressions!) waves 
in a bar is thus 

»=VI- ■» 

7. Transverse Vibration of Stretched Strings and Membranes. 

In many cases it proves expedient to start, not with the general 
equations — modifying them to fit the special problem in hand — 
but to proceed as we did in treating longitudinal waves in bars; viz. 
to derive the equations immediately from the fundamental principles 
of the theory of elasticity, thereby taking advantage of the simplifying 
conditions of the problem. The transverse vibrations of stretched 
strings and membranes form a case in point. These vibrations are not 
only of great physical importance, in acoustics, but their discussion 
leads to mathematical operations applicable in the most diverse 
branches of physics, especially in atomic physics. 

By a string we mean an elastic body whose cross-section is so 
small compared to its length, that it offers no appreciable resistance 



to bending. The fact that the stiffness of an elastic bar diminishes as 
the cro3s-section decreases follows from the results of § 6 (p. 176). As 
the cross-section is made smaller and smaller, we finally have only 
the fibres in the immediate neighbourhood of the central filament, 
and there are no longer any appreciable stresses opposing the bending. 
How is it possible, then, for a string to execute vibrations? The state 
of affairs is completely altered by the fact that the string is stretched 
between two fixed end points. Even in the eqyiiibrium position, 
there exists a stress P normal to the uniform cross-sections A, i.e. the 
tension P is always in the direction of the tangent to the central line 
of the string. Because of the complete flexibility of the string, trans- 
verse forces do not exist. If the string is now drawn aside from its 
position of rest, the resulting increase in length causes a» increase in 
tension. But even this increase in tension is not the true cause of the 
transverse vibrations; we shall see, below, that it is a small quantity 
of higher order than the true cause of the vibrations. The latter, is 
much more direct: in the strained state, the tangents to the string, 
drawn at the ends of an element dx, have slightly different directions. 
As a consequence, there is a resultant force in the direction normal to 
the rest position of the string (see fig. 6). If we consider only vibra- 
tions in one plane, and take the plane of vibration to be the xy -plane, 
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then the y-eomponent of the displacement; which was previously denoted 
by v, is simply y, since the rest position of the string is the as-axis. 
In the displaced position, a force APt{x) acts at the left side of an 
element dx, if we call the unit' tangent vector t. Similarly, a force 
AI J t(x -f- dx) acts at the right side. If a is the inclination to the hori- 
zontal, the components of the resultant are 

F x = AP cos(a + da) — AP cos a, J 
F y ~ AP sin (a + da) — AP sina. I 


We limit the displacements to small values, so that we may set 


cos a = 1 


sin a — a = tan a 


dx 


Also, the change in the tension P is small compared with the value of 
the tension itself. Neglecting all quantities of second order, the re- 
sultant in the transverse direction becomes 

'•""{(SLr(2).H f >■ m 

(In the longitudinal direction the change in. P would need to be taken 
into account, since only terms of second order appear here,) 

Using Newton’s Second Law, the equation of motion of transverse 
vibrations of the element becomes 


3 2 y p 3 2 y 

d-p d- 


(80) 


This equation has the familiar form of the wave equation (c£. p. 62). 
It is to be noted that the tension enters hero, but that the modulus 
of elasticity of the string does not appear. 

The solution 

Pl = a * m *— 

corresponding to a wave travelling in the direction of the positive 
x-axis, satisfies the differential equation, but not the boundary con- 
ditions, viz. y vanishes at every instant for x = 0 and for x — l, where 
I is the length of the string. Similarly, 

y a = < 7 ^+*^ 

is also a solution of the differential equation, and corresponds to 
a wave moving in the opposite direction, but it, too, fails to satisfy 
the end conditions. On account of the linearity of the differential 
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equation, the difference of these two particular solutions is also a solu- 
tion. Moreover, we caa give at least certain particular values of to 
for which the boundary conditions are satisfied. The difference is 

y = Vt-Vi = C*"' = 2 iCe^ sm(ax^j. (81) 

It is evident that the condition that y vanish for x — 0 is satisfied 
for every value of t. On the other hand, y — 0 f or a = l only when 
uiW p/P is an integral multiple of rr. If n is any positive integer, there 
is therefore a corresponding frequency v„, given by 

-"-IVl (82 > 

We may thus consider the vibration of the string to be a standing wave 
resulting from superposition of two oppositely directed waves of the 
same frequency v„. Application of the boundary conditions no longer 
permits all arbitrary frequencies, but only a discrete set. This set, 
it is true, contains infinitely many values, since n ranges from 1 to oo . 

This result may be obtained in a somewhat different way. On 
account of its fundamental importance, this method also will be ex- 
plained here. If, as usual, w© assume equation (80) to have a solution 
of the form 

y— M x)e? at , . (83) 


then ws have, after removing the time factor e*'" 1 , the following 
differential equation in * alone: 



dhu , £o 2 p _ 

-4 t w — 0. ... 

da? ^ P 

. . . (84) 

If we put 

'Z 

II- 

pajfti 

3 

. . . (85) 

this becomes 

g+A^=0. . . . 

. . . (86) 


This equation has a solution consistent with the boundary conditions 
only for certain values of the parameter A®, which are called charac- 
teristic values, or proper values. Taking, as a solution, 

w — sin Ax, (87) 

we see that A must be an integral multiple of irjt, in order that y vanish 
for x~l, i.e. 

A„ = 2 wv n ^/|==y, 


( 88 ) 
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and thus, as above, 



. . (89) 


The functions which correspond to the characteristic values, and which 
are solutions of the differential equation, and which — in addition — 
satisfy the boundary conditions, are called the characteristic functions, 
or proper functions, of the differential equation.* In the present case, 
the characteristic functions are the functions 



Since the real as well as the imaginary parts of equation (83) must 
satisfy the differential equation separately, a solution which satisfies 


the boundary conditions is given by 

y = a n cos 2w M sin-p^ ..... (90) 

as well as by y = b n nm2irvj sinpp (91) 

But the series 


y — S a n oos 2irv a t sin sin 2nv n t sin pp (92) 

is also a solution, since a sum of particular solutions also satisfies 
the equation. In the case of an infinite sum, it must also be provided 
that the series converges. The coefficients o„ and 6 a are determined 
by giving the initial displacement y 0 and initial velocity y 0 as func- 
tions of x: i.e. apart from the spatial boundary conditions to be ful- 
filled at every instant, certain initial conditions must also be satisfied. 
Now 

y 0 (x) = a n sin ”j-, ...... (93) 

y 0 ( x )~ £ 2Trv n b n sm~ (94) 

The determination of the coefficients follows the method developed 
for those of the Fourier series (p. 54). The right member of (93) is 
simply a Fourier series which is to represent the given function in the 
interval 0 to i. To determine the coefficient a m , for example, we mul- 
tiply both sides of the first equation by 



* The German expressions for “ characteristic value ” and “ characteristic func- 
tion ” are Eigenwert and Eigenfunktion. 
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and integrate from 0 to l. All terms on the right vanish, with the 
exception, of that containing 


l ’ 

which, by p. 55, gives the following value for a m : 


2 r l ... nirrx , 

| Jf Vai®) sm — dx (95) 

In similar manner, the second equation gives 

bm== sin^ dr. .... (96) 


The vibrations corresponding to the various frequencies are of dif- 
ferent strength, depending upon the initial conditions. The vibrations 
of tho string are transmitted to the ear by the intervening air, and — 
in so far as they lie within the audible range — are perceived as a tone 
whose pitch is given by the fundamental frequency v v and whose 
quality, or timbre, is determined by the amplitudes of the various 
harmonics, with frequencies v n which are multiples of v t . Thus equa- 
tions (95) and (96) are the mathematical expressions of the fact that 
a given string tone sounds different, according as it is produced by 
plucking or by striking the string. As an example of the determination 
of the coefficients, let us choose the struck string; this corresponds, 
say, to a pianoforte string. For t — 0, let y be zero everywhere. Let 
, y 0 be zero at all points except between (I — a)j 2 and (I + a)j% where 
it i3 to have the value v Q . Thus a is the width of the hammer. Then 
a m = 0 and 



-If the string is not struck in the middle, as assumed here, the coefficients 
will have other values. 

It is not difficult to treat the two-dimensional analogue — the 
vibrating membrane — in the same way. Here, again, let the mem- 
brane be so thin that it .possesses no appreciable stiffness. As in the 
case of the string, it is only because the membrane is fastened at the 
edges that restoring forces arise. Assume the bounding curve to be 
plane, its plane being the a^'-plane of our system of co-ordinates. The 
constant tension P acts upon every element of area in a section normal 
to the plane of the membrane. Consider an element dxdy in the dis- 
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placed position of the membrane. Then the tensions parallel to the 
yi-plane have a resultant 

F lu ~DP d ^dxdy, 

where D is the thickness of the membrane. The tensions in a direction 
normal to this have a resultant amounting to 


Altogether, we have a total force acting in the direction of the 2 -axis 
of amount 




Applying Newton’s Second Law, the equation of motion for the element 
under consideration becomes 


, . dH nD . , /dH , a**\ 
fVteie 8> - DPixi , (jp + jp ) 

3®z 3 2 z _ p ctz 

a? + 8ji» l 'f a?' * ■ 


(93) 


The spatial boundary condition to be satisfied in addition to the dif- 
ferential equation is that a — 0 along the bounding cur/e at all times. 
Generally, at the boundary, we could prescribe definite tmporaEy con- 
stant values pf z or of dzjdx, but not of both, as one might conclude from 
the degree of ‘the differential equation. It is easy to see that the prob- 
lem would be over-determined by specifying both s and dzjdx. Imagine 
the integration carried out numerically or graphically, piece by piece, 
in a strip parallel to the j’-asde. We begin with a surface element 
whose position end orientation are given, and join the next element 
to this, the orient*, lion being determined by the differential equation, 
which given the increase of dzjdx. If we proceed in this way, we shall 
nett necessarily encounter the boundary a second time; hence only 
the position or the orientation may be' specified for the bounding «mm>. 

We again assume that the solution is of fcho form 

z *» w (x. y)#*, \96) 

and obtain, just as in the case of the string. 


d i w , dhi> , . c 
dx? "* by 3 


0, iU 


,,, it 
'V p' 


( 100 ) 


In general, this equation, too, has a solution which satisfies the boun- 
dary conditions only for a discrete sequence -of characteristic values A*,*, 



VIII.] THE MECHANICS OF DEFORMABLE SOLIDS 189 


The nature of the corresponding characteristic functions depends 
upon the boundary curve, they may be complicated higher trans- 
cendental functions — e.g. Bessel functions, in the case of a circular 
boundary. Nevertheless, for given initial conditions, the coefficient 
of a series composed of characteristic functions may be determined 
as for the Fourier series. Thus it can be shown that all the charac- 
teristic functions of the equation Aw -|- Xhv — 0, which v anish on 
the boundary, have the property of orthogonality; i.e. in analogy to 
the va nishin g of the integrals 

X * . mrrx . mrx , 

sin -y~ am j- ax, 

we have in general 

J'UfndT = 0, (101) 

the integral being extended over the entire range of w, i.e. in our case, 
over the area of the membrane. We give at once the proof for the 
three-dimensional case: If f m and /„ are two different characteristic 
functions, then, by G-auss’s Theorem, 

(/*» grad /„ — /» grad/JdS =f (/ m A/„ ~f n &f n )dr, (102) 

which, by (100), is equal to 

/ {K? — K*)fmfndt (103) 

The surface integral is to be extended over the boundary of the region, 
the volume integral Gver the region itself. . But f m and /„ are always 
zero on the boundary, and so the left member of (102) is zero. Since 


and /\„ are in general different, we have 

f/,J n dr = 0 . (104) 

If we now write an arbitrary point function g(x, y, z) as the sum o' 
characteristic or proper functions /<, 

g(x,y,z)=--a 1 f i + a 2 f 2 +...a m f m +..., , (105) 


we may proceed to determine the coefficients exactly as with Fourier’s 
series. If we multiply both sides by f m and integrate over the entire 
bounded region (in the case of the membrane, over the entire area), 
the orthogonality makes all terms of the series on the right, except 
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vanish. The coefficients are thus 


a m 


fg{®, V, z)f m dr 

JfJdr 


(106) 


The characteristic functions are determined only to a constant factor. 
This multiplier is usually so chosen that the integral 


ffjdr 


has the value 1. In this case the characteristic or proper- functions 
are said to be normalized to unity. Consult the additional discussion 
onp. 703. 

Ex. 58. Give the solution of the differential equation for a reotangular mem- 
brane of dimensions a and b. [Try a solution of the form w(x, y ) = y(w;g(y).] 



CHAPTER IX 


The Mechanics of Liquids and Gases 
(Hydro- and Aero-mechanics) 

1. Equilibrium of Fluid Bodies (Hydrostatics). 

The liquid and gaseous states — especially the latter — are charac- 
terized by the fact that no resistance is offered to a change in shape 
which is unaccompanied by a change in volume. While liquids have 
extremely Bmall compressibilities (i.e. the forces opposing a change 
in volume are large), those of gases are very great. Moreover, gases 
tend to expand, filling the entire available volume. If we apply to 
fluids the theory developed in the last chapter for deformable bodies, 
then the absence of rigidity of form is expressed by the fact that the 
shearing stresses P 18 , P 23 , P 31 are zero for an ideal non-viscous fluid. 
(It is to be remembered that the term “ fluid ” includes both the 
liquid and gaseous states.) Actual fluids, it is true, are subject to 
tangential forces when in motion. If two layers slide over each other 
with a certain relative speed, they exert upon each other tangential 
forces proportional to the relative speed. Thus these forces vanish 
for a state of rest, so that it is not necessary to distinguish between 
viscous and non-viscous fluids in hydrostatics. The vanishing of the 
tangential (or shearing stresses) P 12 , P 23 , P 31 implies the vanishing of 
the terms in xy, yz and zx in the equation of the tensor ellipsoid, and 
this is true for every co-ordinate system. This means that every 
co-ordinate system is a system of principal axes, which is possible only 
if the ellipsoid is a sphere. Hence, in this case, all the normal stresses 
Pu, Pm P»s must be equal. We set them equal to — p* Thus a 
normal pressure p, of equal strength in all directions, acts upon any 
element of volume of a fluid. As a consequence, the general conditions 
of equilibrium of a deformable body become greatly simplified. The 
system of equations (35) (p. 170) becomes 

g = grad p. (1) 

In hydromechanics it is customary to refer the body forces to unit 

* Since the normal, stresses in the theory of elasticity are reckoned positive -when 
in the direction of the outward drawn normal to a dosed surface, we must give the 
inward pressure p the negative sign. 
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mass rather than to unit volume. If p is the density of the fluid, and 
G is the body force acting upon one gramme of fluid, then the body 
force on unit volume is pG, and the fundamental equation of hydro- 
statics becomes 

G — - grad p (la) 

P 

Just as it was necessary to introduce a relationship between the 
stress and strain tensors in elastomechanics, so a connexion between 
density and pressure must be supplied in hydrostatics. In general, 
the compressibility of liquids may be neglected, and so p may be con- 
sidered constant; for gases, the required connexion is given by Boyle’s 
Law (see below). 

Of the many special problems of hydrostatics and aerostatics we 
select two — the pressure gradient in the atmosphere and the form of 
the surface of a rotating liquid. For the first problem, we take a 
z-axis vertically upward, and the x- and y-axes in the horizontal plane, 
which is the surface of the earth, since we limit ourselves to a small 
portion of the surface. If the density at a height 2 is p(z), then the 
force acting upon unit mass of air is — gk. Since x and y do not enter, 
grad p reduces to (dp/dz)k, and we have the equation 



Now for gases at constant temperature, the density is proportional to 
the pressure, by Boyle’s Law; that is, 



Using this relationship, equation (2) becomes 



Integrating between 2=0 (at the earth’s surface) and z — 2 , 



or p — p e e-(pJpJ (4) 

This formula hi due to Halley. It is to be remembered that Halley’s 
formula is true only provided the temperature is constant at all levels. 
Nevertheless, it furnishes a fairly good representation of observed 
pressures at various altitudes. 

The integration of the hydrostatic equation is particularly simple 
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if the density may be considered constant and if the external forces 
are derivable from a potential U. In this case 

G = —grad U, . (5) 

and (la) becomes 

grad V + - grad p = 0 

If p is constant, this equation may be integrated at once, giving 

U — const. . ..... (6) 

This equation states that under the assumptions made, the surfaces 
of equal pressure are equipocential surfaces of the acting forces. Now 
the free surface of a liquid is always a surface of constant pressure, 
the pressure being that of the atmosphere. Hence the free surface is an 
cquivotential surface. For a fluid at rest and subject only to gravity, 
this surface of equal potential is a sphere with centre at the earth’s 
centre and radius equal to the distance of the surface from this point. 

We next investigate the form of the surface of a liquid contained in 
a vertical cylindrical vessel and rotating, along with the container, 
about the axis of the cylinder with angular speed at. This is only 
apparently a dynamical phenomenon. After a brief period of adjust- 
ment there results a state where the liquid rotates as a whole, along 
with the vessel, and relative movement of the particles no longer occurs. 
Thus an observer rotating with the system notes a state of equili- 
brium. Apart from gravity, the only force acting upon the individual 
volume elements is the centrifugal force, which we may treat as an 
externally applied force also. Taking the positive *-axis upward along 
the axis oi the cylinder, the potential of gravity becomes gz. The 
centrifugal force is radial, and so is normal to the cylinders 

V -f- y a = r --- const. 

The i> too vector is directed outward and has the magnitude to\ and 

hones A the negative oi the gradient of a function 

Titer., according to equation (6). the surfaces of equal pressure, in- 
cluding the tea surface, are given by 

z .... ~ const,. ...... (7) 

2g 

i.e, they are paraboloids of rotation whoso meridian section has the 


- O') 
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If the origin of co-ordinates is placed at the lowest point (vertex) of 
this carve, the equation of the surface is 

z = ~ *» 

' 2 9 

The vertex of the parabola is, of course, below the plane of the non- 
rotating liquid, while the liquid at the walls is higher than this plane. 

It may be mentioned that tbe phenomenon just discussed has been used 
to produce a concave parabolic mirror for optical purposes by rotating a vessel 
of mercury. The method, however, was not very successful. 

Ex. 59. Using the hydrostatic equation®, derive Archimedes' Principle, 
which states that a body Immersed in a fluid is buoyed up by a force equal to the 
weight of the displaced fluid. (Remember that the upward force is the resultant 
of the pressure forces.) 

2. The Fundamental Hydzodynamicri Equations. 

If a fluid is not in a state of rest (equilibrium), we say that a con- 
dition of flew prevails. This state of flow is completely defined only 
if the flow velocity is known at every point at each instant. In prac- 
tice, however, we deal almost wholly with problems of stationary 
states, i.e. with cases in which the flow velocity vector at a given 
point does not change with time. If, for example, we inquire about 
tbe lift experienced by an aeroplane flying with uniform velocity, we 
may consider the aircraft stationary and the air blowing past it from 
infinity with uniform velocity. This is actually the arrangement used 
in wind tunnel experiments. The distribution of flow about the aero- 
plane is then steady. In the non- stationary state, two kinds of curve 
are to be distinguished— the lines of motion, or paths described by 
single particles of the fluid in the course of time, and the stream lines, 
which are defined as follows: At a given instant let the flow velocity 
vectors at each point be drawn in the form of small arrows. These 
arrows may be put together to form curves whose tangents at each 
point have the same direction as the arrows. In the non-stationary 
state, the picture of the stream lines changes from instant to instant. 
Consequently, a particle which moves from P to P' in a time dt has 
a velocity at P' different from that given by the stream-line picture 
corresponding to the. time when the particle was still at P. In this 
way we see that the, lines of motion and, the stream lines coincide only in 
the stationary state. 

In hydromechanics, just as in elastomechanics, we perform the 
transition from statics to kinetics by introducing the inertial force 
for unit volume, 

d?r 
p ~<ity 
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by means of d’Alembert’s principle, thus referring the kinetic prob- 
lem back to statics. There will be equilibrium between this force, the 
body forces and. the pressure forces when, according to equation (la) 
(p.192), 

pQ ~ p ^ grad ^> 

or ^=0~^gradp. (8) 

Id anticipation of the most important special problem of stationary 
flow we resolve the total rate of increase of velocity per unit time, 
i.c. the vector d?rjJt 2 — dv/dt, into a purely temporal part and a 
purely spatial part. The latter arises from the fact that, in the time 
dt, the volume element moves to a place r + dr, where the velocity is 
v + dv ~ v -f- dr grad v. Reduced to unit time, this part of the 
acceleration is 



where d r signifies the spatial change. 

For non- stationary motion, on the other hand, the flow con- 
figuration changes during the time dt, so that the particle at r -f dr 
is in & state corresponding to the time increment at. Neglecting terms 
of higher order, we may pat for this purely temporal change the value 
obtaining at the point r. Let this acceleration be denoted by dv,'dt. 
The total acceleration is equal to the sum of the purely temporal 
and purely spatial rates of change of velocity, and so equation (8) 
becomes 

~7 -f- zrgrad grad p. . . . (10) 

at p 

Besides this fundamental bydrodyaamical equation we have another, 
called the equation of continuity. The mass of fluid passing outward 
through the surface of an arbitrary volume element, per second, is 

pvdS. 

The scaler product V d8 automatically provides that inward flow be 
reckoned negative. The quantity flowing out must be equal to the 
decrease, per second, in the amount of fluid within the volume element, 
provided we rule out the possibility of the existence of sources or sinks 
within the volume. That is, 

<f pvd8=-~fpdr. 


- ( 11 ) 
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If we convert the surface integral into a volume integral, using Gauss's 
theorem, then, since the equation must hold for every dement of 
volume, we have 


div (pv) — — 


(12) 


For incompressible fluids this equation of continuity simplifies to 
div o = 0. (12') 


la view of the meaning of v grad • t> given on p. 32, the com- 
ponent form of the general hy drody namical equations may now be 

written 




dv v , dv v , dv v dv v 

W + v *Ji + v '3j ' 5 87 ~ 

3t> ? 1 . , ' J " . ^ V ‘ _ 

dt ' J<c dx ~ v 3 y ' Jl dz 



and the equation of continuity becomes 

3jpty 8 (pv v ) d (otq) _ __ dp 

dx dy ' dz of 


iiOa) 


(Via) 


The co-ordinate form of the fundamental siydtodyuamica! equa- 
tions indicates the special mathenuftica) difficulties arising in hydro- 
dynamics. Since products of velocity components with their deri- 
vatives occur, the equations, as compared with those which have- pre- 
viously occurred, arc not linear, and the rale for writing the general 
solution as the sum of particular solutions is no longer valid. This 
rule, which has wide applications to linear partial differential equations 
occurring in physics, e.g. the electrical field equations, has been used 
above, in the treatment of the vibrating string (p. 186). 

We limit ourselves, in what now follows, to the case of stationary 
flow, and so put all pure time derivatives equal to zero. We again 
transform the resulting equations for the stationary star,; and obtain 
a. further classification of the flow patterns into rotational and trro- 
iational flow. From the vector formula 

[A curi £ j = [A [VB] ] -= V . A, B — AV • B = grad A „ B JL grad . B 


derived on p, 38, we have for B — A; 

[A ouri A] --- grad A C A — A grad . A == | grad a* — A grad . A- 



THE MECHANICS OF LIQUIDS AND GASES 


19'/ 


IX.] 


If we apply this formula to the expression v grad . v, we have, for 
stationary flo^y, 

| grad v 2 — fv curl v] — Q — - grad*. . . (13) 

p 

The integration of the equations becomes relatively simple if we are 
deali og with a type of flow for which curl v vanishes in the entire space. 
Thus, if curl v is zero everywhere, then v itself may be represented 
as the gradient of a scalar quantity <&, the velocity potential, and the 
problem is solved whe?r the single scalar quantity 6 has been found 
as a function of the co-ordinates. 

Ex- SO. Using equations (10a), calculate the equilibrium rate of flow of a liquid 
from si hois in the bottom of a vessel filled to a depth h. 

Ex. 61. In the same way, calculate the velocity of escape of a gas confined 
under a pressure p t — p 0 in excess of outside pressure; Assume that Boyle’s 
Lav/, jp/p =* const., holds hare, and neglect the gravity force acting upon the 


3. Irrotationai Flow. 

(a) BermtMi's Equation 

If the body force per gramme is derivable from a potential, which is 
almost always the case, we have Gf = — grad U. In this instance, it 
we integrate equation (13) along a stream line, ds and v have the 
same direction, and so [u curl xj] da vanishes. Then 

..p f, ~f 

j |(grad v 2 )ds -- j f curl a) ds 

Jp, Jp, 

~ f t-~m£U)ds - j ft g radp)d« . (14) 
becomes, for incompressible fluids, 


■s + P + — “ ir 4- U 0 4’ ^ const. . . (15) 

1 p Jj p 

The values of the constants are in general different for each stream 
line. However, if the Sow is irrotationai — i.e. if curl v vanishes 
throughout the entire liquid — then, by equation (.14), the integration 
may be performed along any arbitrary curve. In this case, the con- 
stants must be the same for all st&am lines. Reciprocally, we may 
conclude that if the flow originates in an irrotationai * region, where 



being parallel. 
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has the same value for all stream lines, then the flow remains irro 
tational in the entire region. This is true, for example, for the outflow 
of liquid from an orifice in a vessel, since all three terms of the ex- 
pression , 2 


have the same values all over the free surface. 

(6) Calculation of irrotational (or •potential) flow 

The Bernoulli equation (16) shows that irrotational states of flow 
are not only mathematical simplifications, but also that they must 
occur very frequently in nature. If, for example, air is blown past an 
aeroplane model in wind-tunnel experiments to determine the lift, 
then it is true that the air entering the canal has a given velocity and 
a given pressure, and that the gravitational potential — which, more- 
over, is negligible here — has practically the same value for all stream 
lines. Consequently, according to the above laws, that flow pattern 
which corresponds to the solution based upon the assumption of 
absence, of rotation is the one which will form about the model. 

This pure potential flow, however, leads to paradoxical results. 
For example, imagine an infinitely long circular cylinder immersed in 
a uniform, parallel flow which is normal to the axis of the cylinder. 
According to the above principles, the fluid will exert no force on the 
cylinder. The discrepancy is to be attributed to the fact that even 
for fluids of low viscosity the friction at the Burface of the body is not 
negligible. According to experience, the. liquid clings to the body in 
the immediate neighbourhood of the surface, while pure potential 
flow requires smooth, unhindered motion. Due to the clinging of the 
liquid there is a large velocity gradient in the boundary layer and curl v 
no longer vanishes there. Although the viscosity is small, it causes 
the boundary layer— which in the limit is to be thought of as a sur- 
face of discontinuity (see p. 40)-~to roll up and detach itself, producing 
individual vortices. Such surfaces of discontinuity also can form 
where two streams of different velocity meet, e.g. behind the trailing 
edge of an airfoil. The fact that the methods of potential flow are still 
usable, especially in the two-dimensional case, is due to the possibility 
of representing vortices mathematically and correcting in this way 
the potential flow. Such cases, however, are no longer irrotational in 
the strict sense of (a).* 

Let us now consider the mathematical treatment of irrotational 
flow. We put 

V — grad $ (16) 

* la many bookB on hydrodynamics the term “ turbulent ” is reserved for vertex 
motion in the narrower sense, while parallel flow having a velooity gradient normal to 
the flow, so that curl v 4= 0, is referred to as “ non-irrotational ” 
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The equation of continuity for incompressible fluids then becomes 


div v — div grad <t> = AO =0, ... (17) 

and the actual hydrodynamical equations become 

^gradt 2 — G — ^ gradp .... (18) 

for steady flow. Thus every function O which satisfies the differential 
equation AO = 0 represents the velocity potential of an irrotational 
fluid motion. The problem is then to find the solution which corre- 
sponds to the physical conditions of the special ease under considera- 
tion. With this solution at hand, the velocity field may be obtained 
by taking the gradient, and the actual hydrodynamical equations (18) 
are necessary only for the calculation of the pressure distribution. 
As the simplest example of potential flow in space, we investigate the 
spherically symmetric solution of the equation A<h = 0. Since 
depends on r alone, the equation has the familiar form (cf. p. 63) 


As is at once evident, a solution is 


By taking the gradient we obtain 


This means that the flow is directed radially outward from (or inward 
toward) the point 0. In the former instance, a source exists at 0 — in 
tha second case, a sink. The constant a is determined by the volume 
of fluid Q flowing through a sphere about 0 each second. This amounts 
to 


Q = 477T 2 ^ — 4770 (22) 

Thus a is determined if Q is given. The additive constant 6 does not 
affect the flow, since it drops out when the gradient is taken. Usually, 
however, the potential is so normalized that it vanishes at infinity, 
in which case 6=0. 

A problem which is rather more troublesome to solve i3 that of 
the flow pattern resulting when a spherical obstaole is placed in a 
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parallel stream whose potential function, is <1> -- v^x. The special 
conditions in this case are: At a great distance from the sphere, the 
fluid motion must bo the undisturbed parallel flow; and at the surface 
oif the sphere, the normal components of the flow must vanish. The 
fluid, considered to be ideal and non- viscous, may move freely along 
the surface of the sphere. In this connexion, the 'newer hydrodynamics, 
which corresponds better with observation, differs from the classical 
theory. The more recent theory starts with the assumption of a cling- 
ing boundary layer, as indicated above (p. 19S ). 

We shall not reproduce the solution of this problem of the sphere 
here, since the essential point may be seen from the two-dimensional 
problem of the flow past an infinitely long cylinder, vis:, that the 
resultant force on the obstacle vanishes, which is, or course, com- 
pletely at variance with experience. 

If the flow is taken to be two-dimensional, i.e .if the flow pattern 
is the same in all planes normal to the z-axis, then a large number of 
solutions of the potential equation can be obtained with the aid of 
conformal representation (p. 89). The connexion between the two- 
dimensional potential problem and conformal mapping of the com- 
plex plane is as follows. 

Introducing the potential function, we have 


3# 30 

v a - w ~, g-, 




and div v ■=• A# — 0 (equation of continuity). . (17) 


The condition for irrot&tionai motion is 

dv v dvz __ q 

dx dy ~ 


( 23 ) 


the co-ordinate z being absent. This condition is automatically fulfilled 
by our assumption. If, now, we take another function T and put 


3T = _ 3T 
3 y’ 9 3 s’ 


m 


then the equation of continuity is identically satisfied. In this case, 
the condition that there be no rotation requires of T that it satisfy she 
equation AT =» 0. If, then, in the two-dimensional case, we have any 
function Q which satisfies the potential equation, we can obtain a 
system of stream lines from it in two ways — first, by taking the gradient; 
secondly, by means of the correspondence (24). If, reciprocally, a given 
flow is specified on one hand by the potential function <J>, and on the 
other hand by the stream function T according to equation (24), then 
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since the components «„ and i\ obtained in these two ways must be 
equal, we have 

dx 

0® 3]F f' **'*'* 

dy dx ' 

But these are precisely the Cauchy Kiemann differentia! equations 
(see p. 71), satisfied by the real part <I> and the imaginary part *F of a 
function of the complex variable z — x -f- ty. Since, in addition, <5 
and V F satisfy Laplace’s equation, we may say that a Sow pattern is 
given by every complex function 

Lt ~ © + iW ~ f(x + iy); .... (26) 

the curves $ — const, are the equipotential lines, and their orthogonal 
trajectories are the stream lines. But according to the theory of con- 
formal mapping, the curves O — const, are orthogonal to the curves 
qp as const., so that the curves 'F -- const, are really the stream lines. 
By interchanging the functions <S> and *F it follows that &. second flow 
pattern is given by the conformal representation if we take the curves 
*F = const, to be the equipotentiais and the curves O — const, to be the 
stream lines. If we take the derivative* 



we sse that the complex number w agrees in magnitude with the flow 
velocity, and is the reflection of the vector v in the a* -axis Hence w 
is called the conjugate- complex velocity, Q is called the complex paten- 

tiaL 

We now take a particular form for /(«), which, as will be seen at 
once, corresponds to cylindrical symmetry, viz. 

f(z) «I> -f FF = a iog z or e^^cos— sin— j. (28) 

Separating reals and imaginaries, we find 




(29) 
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By eliminating *F or O, these equations give 

x 2 y 2 — e 2 * ta or $ = a logr, . . . (30) 

and - — tan—. (31) 


The equipotential curves <£> const, are circles about 0; the 
stream lines are therefore straight lines radiating from 0 (fig. 1, a). 
This flow thus represents the two-dimensional analogue of the spheri- 
cally symmetric flow from a source, considered on p. 199 . The con- 
stant a is determined by the amount of fluid passing through unit 
length of a circular cylinder whose axis contains the source. 

On the other hand, if we interpret the corves T = const, as equi- 
potentials, then the circles are stream lines (fig. 1, b). This flow pattern 
is the prototype of a circulation. If the point 0 is isolated from the 



field of flow by circumscribing a small circle, then curl v is zero in all 
the remaining space. On the other hand, the line integral vds taken 
along one of the circles (or more generally, along a curve enclosing 0), 
no longer vanishes, for ds always has the direction of v along the 
circles, and all contributions to the integral, are positive. We shall 
return to this type of flow in § 5 below. 

We now consider the two-dimensional analogue of the problem 
of the sphere referred to above (p. 199), viz. the case of a circular 
cylinder immersed in a parallel stream. We take a co-ordinate system 
such that the undisturbed flow is parallel to the a:- axis, and look for 
a function £2 — f(z) which allows the entire £2-pIane to be mapped 
on that portion of the z-plane which lies outside a circle of radius R, 
It is further required that the normal components of the velocity 
vanish at the surface of the cylinder — i.e. that the circle itself be a 
stream line, for which V F = const. — and finally, that the complex 
velocity w — diljM shall pass over into the real quantity » = j v\ for 
large values of z. As will be shown immediately, the function 


, ( 32 ) 
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fulfils these conditions. This is equivalent to 

• + ^-“(‘ + 5rf ? ) + **(V- ? f?> (S2 ' ! 

By solving the equation for z, it will be seen that for every value of 
Q there is one, and only one value of z whose modulus is greater than 
22; hence there is a unique correspondence between the points of the 
Q-plane and the portion of the z-piane outside the circle. Moreover, 
we see that the curve T = 0 gives the circle a; 2 + y 2 = ii 2 and the 
straight line y — 0, i.e. the circle coincides with a stream hue. Finally, 
the value of Cl becomes vz for z large, and so the complex velocity 
w = dCl/dz becomes equal to the real quantity v. 

Because of the complete symmetry of 
the flow pattern (fig. 2) in front of and 
behind the obstacle, we see that the mag- 
nitude of the complex velocity w is the 
same at any two points on the circle which 
are symmetric with respect to the y-axis. 
From Bernoulli’s equation, it follows that the 
pressures are the same at two such points, 
so that the resultant of all the pressure forces 
vanishes. The concept of a simple potential 
flow thus leads to a paradoxical result in 
this case. The reasons for this have been given above (p. 198). 



4. General Theorems concerning Vortex Motion and Circulation, 

(a) Distinction between vorticity and circulation 
We call the line integral £ v ds, taken along a closed curve G, the 
circulation K along the curve. In view of Stokes’s theorem, 

<£ vds — J(cm\ v)d%, 

we might at first sight expect that in a flow where curl v is zero every- 
where the circulation along every curve would have to vanish. But, 
as has already been seen in the example of circular flow about a point 
0, where 0 itself is excluded (fig. 1, b above), the circulation may, in 
certain cases, differ from zero even when the flow is iixotational. This 
contradiction arises because the region is no longer simply connected 
after we cut out the point 0, i.e. there now exist closed curves which 
cannot be closed in to a point without passing out of the region. These 
are all curves which enclose the small circle drawn about 0. In the 
proof given of Stokes’s theorem (p. 27), the simple connectivity of the 
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■D-space was tacitly assumed. Hence, in the two-dimensional case, if 
we make the region multiply connected by excluding a portion o£ the 
plane, the circulation can be different from zero, despite the fact that 
the flow is irrotational in the region under consideration. Stokes’s 
theorem may be retained, however, if the excluded region is replaced 
by suitably disposed vortices, i.e. if we take a certain distribution of 
curlzt on the boundary of the region. This procedure is especially 
important in the three-dimensional case. 

(5) Thomson** Theorem on the conservation of circulation 
Consider the particles of the fluid which form a ok-sed curve C 0 at 
time t — 0. Since wc rule out discontinuities in the body at the fluid, 
particles originally adjacent will remain so as time goes ou, i.e. these 
particles wiU continue to form a closed curve which, however, may 
undergo changes in shape. The location of a particle on a curve 0 
may be given by a single parameter A, say the length of arc. We now 
write the circulation along G 0 : 

K— & vds — v (38) 


Differentiate with respect to the time. Since A has nothing to do with 
time, the order of differentiation with respect to time and with respect 
to A may be reversed, and we have* . 


<lE 

~¥ 


*5 S«+sE- 


d> \ dt ) ' 


We now assume that the external forces are conservative, i.e. 


- grad F. 


Then, since (grad Fj ~ , 

' (tX d\ 

f 

Inasmuch as the integrand functions F and are definite functions 


* Although two variables t and A occur boro, total derivatives are written, since 
the partial derivative., 3/3i! were used in another sense in § 3 (p. 195) above. Here 
we are concerned with the total change of v in time, not merely the change occurring 

when the position is held fixed. 
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of position, the integrals taken along a closed curve 0 0 vanish, and 
we have 

~ — 0 or K — const. .... (35) 

dt 

The circulation is constant for a closed curve corn-posed, of a given set of 
fluid par tides if the external forces are conservative . This result was 
given by Sir W. Thomson (Lord Kelvin). 


(c) Helmholtz’s Vortex Theorems 

A number ox important theorems, found by Helmholtz in another 
way, may be derived from the Thomson theorem. Imagine the field 
lines of the vector curl v drawn, i.e. the curves whose tangents at any 
point of space give the direction of curl v, oi the axis of rotation u 
of the fluid particles. In order to obtain a measure oi the strength as 
well, we assume that a number of field lines proportional to cwl© 
pass through every square centimetre of a surface normal to curl©. 
These field lines will now be called vortex lines. 

From the general theorem of vector analysis 

div curiv — 0, 

we conclude that the vortex lines &m neither begin 'Am terminate within 
the fluid’, thus they an-, either closed curves o r , in the- case of a bounded 
fluid, f hey beg ' a ana end on the surface. 

A tube-like surface made up of vortex lines called a vortex tube. 
II the cross-section of tills structure is so small that curl© may be 
considered constant over it, we speak of a vortex jikment. For such 
vortex filaments we have the theorem that the product of the cross- 
section by the magnitude of carl© — the so-called vertex strength— 
is constant along the filament; for if we take the surface integral Id (a) 
over a portion oi a vortex filament bounded by two cross-sections 
Aj and „4 a normal to curl©, the contribution of the lateral surface 
drops out, since dS is normal to curl v everywhere on this surface, 
and there remains only the contribution of the two ends A t and 4 S . 
Since vo vortex Lines can originate on the inside, and since curl© 
has the same direction as dS at one end, and. ‘Ac opposite direction 
«>t the other end,, 

J curl© j x A t =-- | curie j* , . . (36) 

Further, we can show that e' cry ' ortex t I ■ *>e it always consists 
of the same fluid particles, i.e. the filament travels along with the 
fluid. .Draw a closed curve G (fig- 3) which docs not encircle the vortex 
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tube. By Stokes’s theorem, the circulation £ a v ds along this curve is 
zero, since curl v remains normal to IS 
at all points, of the surface enclosed by 
0. According to Thomson’s theorem, 
the circulation along a curve composed 
of the particles which constitute the curve 
C at time t must remain zero. This 
means that the curve 0 remains on the 
surface of the vortex tube. Since these considerations are valid for 
every curve on the lateral surface which does not encircle the tube, 
we may imagine the entire surface made up of elements bounded by 
such curves, the circulation being zero along their boundaries! This 
means, however, that the vortex tube consists always of the same fluid 
particles. 

This law holds for vortex filaments also, since a filament may be 
thought of as the common element of two vortex tubes. The motion 
of vortex filaments can be observed, for example, with smoke rings 
produced by blowing tobacco smoke from the mouth. The smoke 
particles serve to make the motion of the air visible. The vortex ring 
moves onward with the air expelled from the mouth. ‘Rotation of the 
air around the axis of the ring may be observed only in the first stages 
of the motion, since it is soon destroyed by the viscosity of the air, 
which has been left out of account here. 

We now consider a curve R which encircles the vortex filament. 
The circulation along this curve is equal to the vortex strength of the 
filament, by Stokes’s theorem. Now by Thomson’s theorem, the cir- 
culation round this fluid curve is constant in time, so that the vortex 
strength must be constant also. We thus obtain the theorem: In « 
frictionless fluid, the strengths of the vortex filaments are constant in time,, 
and hence vortices can be neither created nor destroyed. 

The remarkable properties of circular vortex rings — their indestructibility, 
indivisibility, &c. — led Sir W. Thomson to formulate the ingenious theory 
that atoms are vortices in the ether. This idea was developed by Sir J. J. Thomson 
in an Adams Prize Essay (1883). The matter is only of historical interest now. 


(d) The law of Biot and Savartfor the flow generated by a 
vortex filament 

If the vector curlz) is given as a function of position, the field of 
flow may be calculated from it. We take the simple case of a single 
vortex filament of strength K. This problem corresponds exactly 
co the calculation of the electromagnetic field of a thin wire carrying 
a current I, since for a curve of area A encircling the vortex filament, 
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£ vds — J{cmlv) dS — | curlv J A — K s , . (37) 

while in the magnetic case, for a curve which encircles the conductor, 
we have the analogous relationship (cf. p, 301) 

ji Kd$ = J (curl H) <2S = | curl H j A ~ . (37') 

But the solution of the magnetic problem ia well known; we have the 
law of Biot and Savart: 

a-A/ESpl, ,38') 


where ds is an element of length of the conductor, and r 0 is a unit 
vector in the direction of r. Hence for the present hydrodynamical 
example. 


-fS d ±p>\ 


(38) 


where dt, is an element of length of the vortex filament; 


5. Plane Circulatory Motion. 

(a) Circulation accompanying the. Jiow about an obstach 


Plane potential flow supplemented by the inclusion of circulation 
is of considerable practical importance. While pure potential flow 
leads to paradoxical results, as we saw in the case of the cylinder 
immersed in a uniform stream, it is possible to modify the assumptions 
and thus attain satisfactory agreement with experimental results. 
This modification consists in introducing a term which givfes a finite 
circulation, but which leaves the actual region of flow irrotational. 
The addition of this circulation allows for the vortex formation re- 
sulting from fluid friction in the boundary layer. From the point of 
view of strict accuracy, it is true, it must be borne in mind that two- 
dimensional problems do not occur in nature, since actual obstacles 
are of finite length, and that the disturbances originating at the ends 
are not included in these calculations. As in § 3 (p. 201), we utilize 
the complex plane for the representation of the plane motion. Let 
the stream lines be obtained by means of the complex function, accord- 
ing to the method of § 3. The derivative dCl/dz gives (p. 2ol) the com- 
plex velocity w, or v„ — iv v , with the help of which the calculation of 
the circulation about an obstacle becomes particularly simple. Thus 
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if we integrate along the profile of the body, we have, since the obstacle 
is bounded by stream lines, 

(39) 

dx vf 

or v x dy — v v dx = 0 (39') 

Consequently [of, equation (27) (p. 201)], 

K cj)vds— <j) [vjlx -f %% -j- i(y x dy ~ v y dz'<] ~ (j) wdz. (40) 

The evaluation of the complex integral becomes particularly simple 
if we use Cauchy’s Integral theorem. Since the disturbance vanishes 
at a great distance from the body, i.e. the Sow passes over into the 
undisturbed uniform flow, the complex velocity must be represeutable 
by a series in powers of L/z, the point i — 0 being within the body; 

V) ~ v x -f -Cl 4 . 4 - , , (41) 

According to Cauchy’s theorem ( 33 . 72), we then have 

E ~ <4 v?dst p- 2jrt»_ v , . , . . (42) 


{ b ) The Kutta-Jouleawski ’. ■ft formula 

We can also calculate very simply, by means of complex functions, 
the resultant pressure exerted by a stream upon an obstacle. The 
force acting on unit length of & cylindri- 
cal obstacle is given by 

V-.-- fpnds, , „ . ( 48 ) 

where n is the direction of the normal 
directed toward the interior of the 
body (fig. 4). Since the normal com- 
ponent of vho velocity vanishes »v the 

surface of the obstacle, the integral is / | a;r_ 

taken along a stream line, and wo. can 
apply Borneo ill’s formula 

f | £ = A 

i.e. (f pnds— - ~ rfj t- .'6 hndn. 



. -(44) 
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The second integral on the right vanishes, however, for a plane curve, 
as may be shown readily. Thus, if we call the unit tangent vector t, 
and the unit vector normal to the plane of the curve b, then n = \bt\ j, 
whence 

j) lends = k <f> [(bt)]ds = Ic^b £ tfdsj = h^b £ efcj = 0, (45) 

since <ji ds vanishes for every closed curve. In order to evaluate the 
first integral, it is convenient to introduce a complex number 

Z=F v + iF x , (46) 

whose radius vector in the complex plane is thus a reflection of the 
force vector in the line biseeting the first quadrant. Now, by equations 
(43) and (44), 

F v — — | £ v 2 cos (ny)ds, F a ~ — | j) v* cos ( nx)ds . (47) 

But from fig. 4 we see that 

ds cos (ny) — dx, ds cos (nx) == — dy. . . (48) 

We thus obtain 

Z — — ■* (f v\dx — idy) 

= -%(f.W + v v *)(dz~idy). • ■ (49) 

If we add under the integral sign the expression 
2i {v jty — - v,jdx) (v a — iv v ), 
which is aero, by equation (39') (p. 208), we obtain 

Z — — ~ j) (v x s — 2iv x v t — v 2 ) (dx -(- idy) — — ? ^ w 2 dz. (50) 

Since the disturbance caused by the obstacle vanishes at a great dis- 
tance from it, the complex velocity w may be represented, according 
to equation (41), by a series of descending powers of z, if we take the 
origin within the obstacle: 



The corresponding complex potential is, by integration, 

II = o 0 + w x z + «-i tog 2 — — + (4T) 
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By equation (42), the coefficient is connected with the circulation 
by the equation 


If we form the integral <j)W 2 dz, we are concerned only with the term in 
1 jz. In squaring tv, this term obtains the coefficient 2^ a _ j and the 
value of the integral becomes 


Z = —pKw m — —pK(v Xoa — iv Boo ). . . . (52) 

From the definition of Z, in (46), we thus have 
F x = I(z) =+pKv Vm \ 

F v = R(z) = - P Kv Xm ) ( / 

These equations constitute the Butta-Joukowski Life Formula, 
so rtamed after its discoverers. It shows at once that the simple poten- 
tial flow about a circular cylinder causes no resultant force on the 
cylinder, since the complex velocity 

vB? 


which, by equation (32) (p. .202) occurs in that case, contributes no 
term in 1/z to the integral w 2 dz. In reality, however, the breaking 
away of the boundary layer from the cylinder causes a vortex region 
to form behind the body, and this invalidates our calculations. The 
turbulent detachment of the boundary layer can be checked by setting 
the cylinder into rotation. In this case we have, superimposed upon 
the solution (32) (p. 202), another flow which streams around the 
cylinder in circular paths. The corresponding complex potential has 
already been encountered in § 3 (p. 202), and is A log z. The complete 
flow is then given by the potential function 



y) + A lo g*> • ■ 

■ • . m 

and the complex velocity 

vR 2 A 

, . . (55) 

w = v - 

z* + z 

The circulation is 

iw “i S3- • ' 


K = 2mA, 

. . . (56) 
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If we assume complete adhesion of the boundary layer, the circulation 
has the value % twR 2 , where o> is the angular velocity of rotation of 
the cylinder and R its radius. If the original 
flow is along the jc-axis, we have a force in 
the direction of the y-axis, 

F v = -2-TTpwR^v (57) 

The force is directed toward the side on 
which the stream and peripheral velocities 
have the same direction, since K is positive 
when the surface is on the left.* This pheno- 
menon is known as the Magnus Effect, after 
its discoverer. This somewhat unexpected 
transverse force has recently had an interest- 
ing practical application in the rotor ship. 

By means of relatively simple trans- 
formation functions the flow pattern of 
a combined parallel flow and circulation about a circular cylinder 
may be converted into the flow about a profile similar to the 
wing sections used in aeronautics. According to the magnitude of 
the circulation about the cylinder, we obtain flow patterns of types 
a or c (fig. 5). Case b represents the transition. In case a, where the 
circulation is small, the flow would have to pass around the trailing 
edge to the upper surface; in case c, the flow would be required to 
leave from the lower surface. Since it is an empirical fact that a fluid 
does not pass around sharp comers in this way, the circulation which 
actually occurs must be that corresponding to case b, where the fluid 
leaves the profile smoothly at the trailing edge. Thus the lift of a given 
wing section may be calculated in advance if the section is obtainable 
from the circle by conformal transformation. It is to be remembered, 
however, that the two-dimensional computation is a somewhat rough 
approximation to the actual case. 

It may be of interest to .sea how the creation of circulation about the profile 
is to be reconciled with Thomson’s theorem concerning the constancy of the 
circulation. If we imagine the wing section brought into an ordinary parallel 
stream for which no circulation exists for any curve, then no circulation can 
come about, according to Thomson’s theorem. But, as already emphasized; the 
friction at the boundary layer oannot be neglected, even for a fluid of low vis- 
oosity. As a consequence, Thomson's theorem may be applied only to a fluid 
curve enclosing the body, and at a great distance from it. The circulation remains 
zero for suoh a curve. To the same degree as a circulation develops about the 
profile, an oppositely directed vortex — the so-called starting vortex f —develops 



* The value in (57) holds only in the limiting case where no detachment of the 
boundary layer occurs. Actually, however, the value of the rotational velocity of 
the fluid to be substituted in the equation is only about m/2, since a partial detachment 
of the boundary layer always takes plaoe. f Ger. Anfahrwirbel. 



212 


MECHANICS 


[Chap. 


at the trailing edge. When its strength becomes equal to the final value of the 
circulation about the profile, this vortex is shed downstream, and the flow pattern 
remains steady thereafter. On account of the presence of both this vortex and 
the circulation about the profile, the circulation remains zero about a curve 
which encloses the profile and is at a great distance from it, since it encloses 
both the vortex and the circulation about the profile. 

6. Undulatoey Propagation of Disturbances in Fluids. (Sound Waves.) 

Compressions and expansions which are propagated in liquids and 
gases in the form of waves are among the non-stationary kinetic 
phenomena lending themselves most readily to mathematical treat- 
ment. The reason is that the velocities and density changes which 
occur are so small that products of these quantities may be neglected. 
For this case, then, we may strike out the term v grad , v from the 
equations of motion (p. 195). In the absence of external forces these 


equations then become 

^=_lgradp . (58) 

Let the density be represented by 

p= p[l+o(t,x,y,z)] (59) 


where p is the average density and a is a measure of the departures 
from this normal value. Putting thif| expression into the equation of 
continuity (12) (p. 196), and neglecting terms of the second order, we 
obtain after division by p: 

|j + divt>= 0. ...... (60) 

Since (58) is equivalent to three scalar equations, we have four equa- 
tions for the five variables v x , v v , v t , p, or: thus we need one more 
equation. This comes from the relation between p and p, or between 
p and a. For gases, Boyle’s Law 

P 

V P 

(p — - average pressure) suggests itself. However, this leads to erroneous 
values for the velocity of sound, the reason being that Boyle’s Law 
holds for constant temperature, while the compressions and expan- 
sions follow one another so rapidly here that the heat generated ac- 
cording to the laws of thermodynamics (cf. p. 506) cannot, be dissi- 
pated sufficiently rapidly. We must apply the adiabatic law (p. '506), 
which holds for complete thermal insulation. In terms of the density, 
which is inversely proportional to the volume, this law is 
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| = (jy - (1 + a)y « % + ya, ... (61) 


where y is the ratio of the specific heat at constant pressure to that 


at constant volume. Then 

grad p = py grad o, (62) 

and, again neglecting products of small quantities, 

- gradp = ^ grad ir (63) 

P P 

If we substitute this value in the simplified hydrodynamical equation 
(58), we obtain 

• (64) 


It is easy to eliminate v from equations (60) and (64). Since deri- 
vatives with respect to position and time may be taken in either 
order, we need only differentiate equation (60) ouce more with respect 
to t, take the divergence of equation (64), and equate the two 
resulting expressions for 


The result is 


div grad a — Aa — 


JL 3 !f. 

py 3f 2 ‘ 


(65) 


But according to p. 63, this is the differential equation of a wave 
motion propagated with the wave velocity 

« = ^ (66) 

Thus we have the formula for the velocity of sound in gases. The 
average density p is still a function of the temperature. By p. 494, 
we have 

(67) 

where p 0 is the density at 0° C., 6 is the centigrade temperature, and 
a is the coefficient of expansion of the gas. Hence 

,-M IH (68 ) 

V Po 

Since p 0 is proportional to p, v is independent of the pressure. 

Ex. 62. Calculate the velocity of sound in water, the value of the compres- 
sibility being x = 60 x 10~ a atm. -1 and y == 1. 
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7. Hydrodynamics of Viscous Fluids. 

(a) Simple Unear laminar flow; Poiseuille's Law 
TIis term ‘‘laminar” is applied to a flow pattern in which the 
fluid may be divided into parallel layers which flow past one another 
with various velocities. The simplest case is that where the velocities 
of all layers have the same direction, say that of the awards. , As a 
result of. internal friction (viscosity), the more rapidly moving layer 
tends to drag the neighbouring layer along with it, and thus accelerates 
it Reciprocally, the slower layer tends to retard the faster one. The 
correct definition of this force goes back to Newton. The force is 
proportional to the common area of contact S of the two layers and 
to the velocity gradient normal to the direction of flew. Thus if the 
velocity v increases along the positive z-axis, the layer just above 
that for which s --- 0 exerts a tangential force upon an area S of the 
latter of amount 

«* 

In this case the force is in the direction of the positive x-&xis, and 
thus Is to be reckoned positive. The factor of proportionality r t is 
called the coefficient of viscosity. Laminar flow occurs, for example, 
when a viscous fluid flows slowly through a circular cylindrical tube. 
From experience we know that the liquid adheres to the walls; hence 
the velocity increases as we go toward the axis of the tube, and dvjdr 
is negative. Take the element of volume to be a hollow cylinder of 
fluid with radii r and r -f dr. The adjacent layer on the inside exerts 
a force on this cylinder which is along the positive as-axis and amounts to 

— 27 "Jfrgp* 

where l is the length of the tube. On account of the decrease of velocity 
toward the outside, the contiguous layer of fluid on the outside tends 
to retard tho cylinder with a force 

r,_+s^(H- *>(£ + *[*). 

If the pressure at the inlet end of the rube is p x and that at the outlet 
p 0 , then the resultant hydrostatic force on the ends of the volume 
element is 2nrdr(p 1 — p 0 ). In the stationary state corresponding to 
uniform motion the pressure force and the forces of viscosity must be 
in equilibrium. Hence, neglecting higher order terms, 

~ - 2m,Z d - (r = -2wr(p x -- p 0 ). (70) 
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This equation may be integrated at once: 

* (Pi~P o) r , G 

dr 2 pi ~ r' 


(71) 


Integrating a second time, 

?+C\o g r+D. . . . (72) 

The constants of integration C and D are determined in the following 
way: Since v must remain finite along the axis (r = 0). C must be 
zero. Moreover, v must vanish at the wall, where r— a. Hence, 


The solution is thus 


(a 8 - r 8 ). 


(73) 


If the velocity vectors of individual fluid elements are drawn with 
their initial points in a given cross-section, their termini lie on a sur- 
face of rotation whose meridian section is a parabola. In this case 
we speak of a parabolic velocity profile. The total volume W issuing 
per second is obtained by taking the integral j zid8 over a cross-section. 
In this case we have 


W - r 2rrdr = ( 74 ) 

Jq 4 7fi 

This is Poiseuille’s Formula, which states that the quantity of fluid 
issuing each second is directly proportional to the pressure difference 
and to the fourth power of the radius of the tube, and inversely pro- 
portional to the length of the tube and to the coefficient of viscosity. 

Simple laminar flow, however, is not the only form which can 
develop in a tube. At greater velocities the flow breaks down into 
another state where ( 1 ) the velocity of efflux is no longer proportional 
to the pressure difference, but increases more slowly, ( 2 ) the values 
of velocity and pressure at a given point are not constant in time, as 
before, but oscillate about mean values, and (3) the paths of individual 
particles are no longer rectilinear; there is a continual intermixing of 
fluid elements, as may easily be demonstrated by injecting filaments 
of coloured Liquid. This state of flow is called turbulence. The initiation 
of turbulence is determined by the value of a dimensionless quantity 
called Reynolds’s number, after its discoverer. This quantity, for a 
circular tube of radius a, is given by 

3 

V 


(75) 
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where p in the density of the fluid and v is its velocity. There is a 
rather definite lower limit of the value of R corresponding to the 
beginning of turbulence. Turbulent flow has not been produced for 
values of R below about 2200. Whether or not turbulence appears 
for values of R above 2200 depends upon the degree to which the fluid 
is free of disturbances, especially at the inlet. The theory of turbulence 
and of the breakdown of laminar flow is still an incomplete! solved 
problem of hydrodynamics. - 


( b ) Extension to arbitrary states of flow 


In order to calculate the stresses on a volume element of a viscous 
fluid we can avail ourselves of the formulae of the theory of elasticity, 
provided we bear in mind that in place of the displacements u, v, w 
{p. 162), their time derivatives appear, since the viscous forces operate 
only when motion occurs. But these derivatives are merely the com- 
ponents of the flow velocity. Thus we must put into equations (2J) 
(P- 166), 


dVg 

dv. 

dv. ) 

dx 


dv v 

dv 

dv. 

dy 


dv. 

dv. 

dv. 

dz 

* 31 ~ Tx + di~*» J 


We shall assume, in what follows, that the fluid is incompressible; 
hence e 11 -f- e 22 + e 3S — 0. We must now introduce the coefficient, of 
viscosity ij in place of the elastic constants E and a. To see how this 
is to be done, we return to the above special ease of linear laminar flow. 
Let the flow now be along the y-axis, and lot the velocity increase in 
the direction of the positive cc-axia. Thus 

«, = 0, v„ = v, V'=0. . . . . (77) 

The elastic , equations of p. 174 give the shearing stress on unit area 
normal to the x-axis, acting in the direction of the y axis, as 

P — E c — E dv /7V 

2(l+g) ^-211+^) dx' ' * * (78) 

while equation (69) (p. 214) gives 

dv 
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E 

for this quantity. Thus — — - 
2(1 + a) 

is to be replaced by ij, and we have 

p «- 2 ni ™ 

p "- 2 4 ' 

According to p. 170, the components of the resultant of the stress 
forces per cubic centimetre become 

'■- 8 &+T£+'£-*'*-*'i&+^+ , VSr 

32y .' , 82 «. . S*®*\ 

~ V \ 2 W + dy* + dxBy + dzdz + dz*)’ * * * (80) 

or F, == ^(Av* -f- ~ div t>) = ijAv,., ... (81) 

since div = 0. Similarly, 


If these forces are added to the previously considered forces acting on 
unit volume of a fluid, we obtain the additional term rjAv in the vector 
formulation. But by the familiar vector formula 

curl curl v — grad div v — Av, 

the viscosity term rj Av becomes simply —rj curl curl v, since div v= 0. 
Denoting, as above, the external force per gramme by G, the funda- 
mental hydrodynamical equation for incompressible viscous fluids 
becomes 

|^ + v gt&i.v— G — - gradn — - curl curl v. . (82) 

dt p p 

If the stresses due to viscosity are allowed to remain in the equation, 
this expression becomes, using (80), 

^~ + v gvai . v= G — 1 gradp + - (* div Pj + j div P s -f k div P 8 ). (83) 
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(c) Stokes’s Law for the fluid resistance to a moving sphere 

Stokes was the first to succeed in determining the resistance ex- 
perienced by a sphere moving uniformly through a viscous fluid. 
Naturally, it amounts to the same thing if we consider the sphere 
fixed and the fluid to be Sowing by it with the opposite velocity u 0 . 
We are thus concerned with finding a special solution of (82) which 
corresponds to the flow about a spherical obstacle and calculating 
the resultant force acting on the sphere from the flow pattern obtained. 
Stokes assumed the speed to be so small that the expression v grad . v, 
containing the products 

3t>» 

v “ai’ ”* 

could be neglected. That is, in the stationary state — which alone 
interests us — the left member of (82) may be set equal to zero. This 
approximation, however, leads to erroneous results in the case of a 
cylinder, as Stokes himself pointed out. As Oseen has shown, the 
term v grad . v must be taken into account, even for low speeds, at 
least to the extent that it is included in the form v 0 grad . v — i.e. the 
flow zris replaced by the undisturbed flow v 0 except in the derivatives 
of velocity. For the sphere, this furnishes only a small correction term, 
Stokes’s analysis being thus justified, and his result verified. For the 
sake of simplicity, we shall therefore give a brief reproduction of 
Stokes’s mathematics. Since ws are concerned only with viscous 
forces, we leave the external force — the force of gravity — out of con- 
sideration. Its only effect is to produce a hydrostatic buoyancy. Wo 
proceed, then, by replacing all forces not connected with viscosity by 
the hydrostatic forces, since the velocity is low. However, in computing 
the total force, these contributions must not be forgotten. Thus, in 
the problem of a sphere failing through a viscous liquid we must not 
forget the buoyant force. In order to calculate the viscous forces, we 
thus consider the fluid freed from gravity, and the hydrodynamic 
equations reduce to 

grad p — -q&v; tliv v — G. .... (84) 

We try a solution for v which iB the sum of a potential flow v t = grad G> 
and a flow v t not derivable from a potential. Then 

gradp — 7) A grad -f ijAt » 2 — 17 grad A<X> + *?A» a . ( 86 ) 

We further assume that v 2 satisfies the equation Av 2 — 0 . Integration 
of ( 86 ) then gives 


p = ijAfc -f- p 0 . 


. ( 86 ) 
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The condition of mcompressibility, div v — 0, furnishes the additional 
equation 

div grad <D -f divt> 2 — A<X> -f- div© 2 = 0. , . . (87) 


The simplest assumption we can make regarding v 2 is that there is 
only one component different from zero. This quantity must then 
satisfy the equation Av 2 * — 0. The simplest solution of this equation, 
corresponding to spherical symmetry, is 


Thus we put 


.1 


(88) 

(89) 


v 2 = - i, div v z =i ( grad - ^ = a -I. 
r \ r f ax 

According to (87), this gives the following differential equation for <P: 

3 * 

A® -{- a — 0. (90) 

ox 


Wo may verify in a moment that a particular integral of this equation is 



for this gives 



In order to have sufficient constants to satisfy the boundary con- 
ditions, we add to this solution functions which satisfy the equation 
A<S> = 0. Thus these functions drop out when substituted in equation 
(90). Since oux previous particular solutions vanish at infini ty, we must 
add a term which fulfils the condition that the flow at a great distance 
from the sphere is parallel to the x-axis. A suitable term is v 0 x. Another 
function which satisfies AO = 0 is, for example, b/r. But further, on 
account of the interchangeability of the operators d/dx and A, all 
derivatives of 1/r also satisfy the equation A<t> = 0. We shall show 
presently that all the boundary conditions can be satisfied by adding 
the single function 


3 *’ 
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our assumed solution is thus 


A a. A a a rV, 

» = grad V-o*+^--o »-/ + ;*• * 


Written in component form this is 
M 1 


, , r ad*r , a . ,/3as» 1\ a/1 * 2 \ , 

v *= V ° +b dl?~2M + -r~ Va+b \l?-?)-2\r~*)^ 


b o a z r __ , 3*y a *y 

3*3y 2 3*3y r 5 2 r 3 ’ 


* 3*02 2 3*32 r B 2 r 3 ’ 

For r==R all components of » must vanish for all values of *, y, z. 
This is true for the y- and the 2 -components if we put 
. o2P 


Then the coefficient of ** in the *-component vanishes of itself, and 
by putting the remaining terms equal to zero we obtain 
3 R , , v 0 R a 

o = — w 0 --, and b= . . . (94) 

The solution corresponding to the boundary conditions is then 



By equations (86), (90), and (94) the pressure becomes 
3- 

p = Po + v^ = Po~ a, n^ x = Po- y 7?*- • ( 96 ) 

The resultant force acting on the sphere is obtained by evaluating the 
stresses from the velocity components by equation (79) (p. 217 ), and 
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tairiiig the integral of the stresses and of the pressures over the sphere. 
We thus have (cf. p. 168): 

F= — j)pd8+ <f {^1 cos(m)+P 2 cos (nj) + cos(nk)}d$, (97) 

the pressure having the negative sign (see footnote, p. 191). On account 
of symmetry, the resultant can have only an ^-component. The value 
of this is 

F„ — F — — £ pidS 

+ £{P n 008 (***) + I J 21 COS (nj) + P 31 cos (nk) } dS. (98) 

Because of the symmetry of the stress tensor (P 12 — P21)’ we have 
{ P n cos (ni) + P 21 cos (nj) + P 31 cos (nk)] dS — P 1 dS, (99) 
and so the resultant becomes 

P = <£ (Pj _ p £)d8 (100) 

This integral is to be taken over the sphere of radius R. We can save 
much calculation, however, by noting that the divergence of (Pi — pi) 
vanishes. For 

div (Pi — pi) = div P t — 

but this expression is zero, according to our assumptions, as may be 
seen from the component form of equation (83) (p. 217). Application 
of Gauss’s theorem then shows that the integral has the same value if 
taken over a concentric sphere whose radius is allowed to become 
infinitely large. Since the surface element of this sphere increases as 
the second power of r, we can drop all terms of higher order than 1/r, 
e.g. terms like a ; 2 /r 5 , in (92). For r large, the velocity components 
then simplify to 

a . a 7 ? a xu a xz 

V * a = v o + 2^ + 2 V v* — 2 fS 9 v *«> 3=1 2 i 5 ' ^ 95 ^ 

Here a stands for — § v 0 R. In order to calculate the stresses we need 
the derivatives 



( 101 ) 
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Then by equations (79) and (96), 

F = j> (P x - pi)dS 

= — j) ‘da-q ~ ids — <f) Bat) — £ 3ujj ^ kdS — <j) pfidS 

— —3a-;} j) ^ (a?* -f yj + z£)dS — ^ Jv'dS 

— — 3cm; j) ^ r dS — j) p Q id& (102) 

The last integral vanishes on account of the constancy of the integrand. 
In the first one, 

dS — rrsinddddfiA 

J-corf* } « 

so that 

F —■ —Zarj f f cos 2 6 sin 0 dd drf> — -j- — j~ cos 3 fl~| 

Jo do $ |_ Jo 

— — brar} (104) 

Inserting the value of a as given by (94) (p. 220), we obtain the final 
formula 

F = Qir>]V 0 R (105) 

The force is thus directly proportional to the radius of the sphere and 
to the relative velocity of sphere and fluid. This law is of especial 
importance in the study of the atomic nature of electricity (cf. pp. 418 
and 425). The law has been well substantiated for spheres falling with 
low velocities in viscous fluids 


8. Surface Tension of Liquids. 

(a) Definition of the surface tension constant 

A large number of familiar phenomena — for example, those as - 
sociated with soap bubbles — indicate that the surface of a liquid 
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behaves like a stretched membrane, and that work 
must be done to increase the area of the surface. 

What causes this tension? In order to explain the 
cohesion of liquids we must assume that the mole- 
cules are subject to mutually attractive forces which 
decrease rapidly as the distance increases, and are 
therefore of little effect in the gaseous state.* A group 
of molecules shielded from all external forces would 
thus assume the configuration in which the particles are at the mini- 
mum distance apart, i.e. a sphere. Every deformation of the sphere 
enlarges the surface and increases the average distance apart of the 
molecules, thus requiring that work be done against the attractive 
forces. If we pursue these idea's mathematically, and calculate the 
total potential energy of a quantity of liquid, we obtain a part 
proportional to the volume and another proportional to the surface., 
Since the former increases as the cube of the linear dimensions, the 
tatter as the square, the surface phenomena will be more prominent 


when the volume considered is small. 

We begin by considering the surface energy 

....... (108) 

The factor of proportionality a is oalled the surface tmsiem. If we 


mark off a portion of the surface (fig. 6) by means of a closed curve 0, 
then a force r, lym$ in the surface, acts on. each unit length of this 
carve. Let the curve <7 pass into the curve 0‘ when the surface is 
enlarged. Let S n be the displacement of an element of arc ds 
in the direction of the normal to C lving in the surface. Then 
the work done against surface tension m displacing the curve C is 

—-5TT — (j) rds 8n. . , . . . (107) 

Since r has the same value at all points of the surface, as in the case 
of a membrane, it may be brought outside the sign of integration. 
Also, since ds Bn is equal to the increase in area, we have, by (106) 
and (107), 

SU — —8 W ~ rS S — a 88. . . * . (108) 

The surface tension a is therefore equal to the force r acting upon unit 
length of a bounding curve. 


* Gravitations,! attraction must be assumed to exist between molecules, bat it is 
of a rnuoh lower order of magnitude than the true molecular forces, which art electrical 
in nature. Repulsive forces do not come into play until the molecules are very done 
together. It is these forces which cause the molecules to occupy sjwoe. 
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(b) Normal pressure at a curved surface. Differential equation 
of the surface 

We consider first a cylindrical surface (fig. 7) and consider the 
forces acting upon a surface element 
bounded by two generators and two 
curves normal to them. Since the ten- 
sions acting on these two curves lie in 
the surface, they are parallel to the 
generators, and hence can produce no 
resultant normal to the surface. On the 
other hand, the tensions acting on the 
generators form an angle with one 
another, and have components (fig. 7) 
directed toward the concave side of the 
surface. Calling the length of the 
generators ds ly that of the other bounding curves ds 2 , the inward 
component of the force acting on each generator amounts to 

, y, ds i dso dS <x (is 

i-e- Pi == Jr (109« 



For a. surface of double curvature, we bound the surface element by 
two pairs of neighbouring lines of curvature. The lines of curvature 
are characterized by the fact that neighbouring normals to the surface, 
at points along these tines, intersect each other, and that the directions 
of their tangents are those of the normal sections of greatest and least 
curvature of the surface. Since the two families of lines of curvature 
are orthogonal, the above considerations concerning the cylinder are 
valid for each pair of sides of an element of a doubly-curved surface. 
The total normal force is then 



and the inward-directed normal pressure is thus 



(HO) 


The principal radii of curvature R x and are to be considered posi- 
tive if the corresponding centre of curvature lies within the fluid. The 
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normal pressure may assume negative values for concave surfaces, 
i.e. it becomes a tension. 

To compute the equilibrium form of 
the surface we apply the principle of 
virtual displacements. Imagine the surface 
deformed in such a manner (fig. 8) that each 
element is given a displacement Sn in the Fig. s 

direction of its normal. Calling the dis- 
placement positive when directed outward, the work done against 
the normal pressure caused by surface tension — which is equal to the 
increase in potential energy — is 

= jTa (~ + ~) dS8n. . . . (Ill) 

In addition to this change of potential energy there is another change 
connected with the potential of the external forces, in particular 
gravity; for fluid may be transferred to points of different potential 
as a result of the displacement. Since the energy-changes accom- 
panying the virtual displacements are small compared with the actual 
differences of potential along the surface, we may, in calculating this 
change of potential energy, take the values for the undeformed sur- 
face, say per unit mass. Then this part of the energy change is 

SZ7 a = J P d>dS8n (112) 

Altogether we have, for equilibrium, 

817 = jfa dSSn + f pQdSSn 

-f Hi + k) + ^] iSS ’' =0 - ■ <113) 

The displacements 8 n, however, are not entirely arbitrary. They must 
be so related that the volume of the liquid is not altered. But the 
change in volume is given by the integral J dS on, which must vanish. 
Hence the vanishing of the integral of equation '(1 13) does not allow 
us to conclude that the integrand is zero. In order to satisfy the 
auxiliary condition we again apply the method of Lagrange’s multi- 
pliers established on p. 115 for the case of the vanishing of the varia- 
tion of a sum, and immediately applicable to the variation of an 
integral, which is the limit of a sum. Thus we multiply the equation 

SV — J dSSn = 0 ..... (114) 

by the undetermined multiplier A, which we may write under the 

(B711) 
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integral sign, since it is constant over the surface. We then add this 
equation to (113). Now, from the same considerations as on p. 1 16), 
we may conclude that the vanishing of the integral implies the vanish- 
ing of the integrand, whence 


Ki + i) + /><& + A==0 ' * • 


This is the differential equation of the surface of a liquid. The con- 
stant A must be determined from the known values of 1^, R z and $ 
at some point on the surface. If, for example, we consider the rise of 
a liquid surface at the walls of a vessel, A may be given readily for the 
middle point, where the surface may be considered 
H plane. It is also convenient to reckon the gravita- 

tional potential from this point, so that A ~ 0 in this 
||<x:y case. 

H (c) Boundary conditions 

In order to integrate the differential equation, the 
boundary conditions must also be known. The usual 
case is that in which the liquid is confined in a vessel 
whose walls intersect the surface in a boundary curve. 

Fig. 9 Fig. 9 represents a section normal to the bounding curve. 

At the surface of contact between the liquid and the wall 
there exists a surface stress a lf whose magnitude depends upon the 
nature of the liquid and of the material composing the wall.* For 
equilibrium, the component of the surface tension a parallel to the wall 
must balance the boundary surface stress a lf . No condition of equili- 
brium need be satisfied by the normal component, for such a com- 
ponent manifests itself solely by a force against the. wall; on the other 
hand, if the tangential forces wore not in balance, the hounding curve 
would move along the wall. In the case, where the liquid wets the 
wall, the acute angle between wall and liquid surface (fig. 9)— the 
so-called angle of contact — is denoted by 8. In this case 

o cos 8 -f- a x1 = 0 , (116) 


This means that c nf is negative in this case. If 


of the ooosUat & Introduced above really is contingent wpou 
above the liquid. However, the difference* for various gases i 
speaking, a is the surfaoe teoaion of the liquid with respect to 
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fchtffiS exists no finite angle of contact, and tlie liquid spreads along 
the wall in the form of a thin layer. This is the case for the contact 
between water and glass. It corresponds to the boundary condition 
5 = 0. The spreading takes place even along a vertical wall. For a 
thin film, the work done against gravity is so small that it sets prac- 
tically no limit to the spreading of the liquid. 

(d) Liquid films 

A liquid, film (e.g. the wall of a soap bubble) is bounded by two 
parallel surfaces which are close together. Consequently, the normal 
pressure of a curved film is twice that corresponding to each free 
surface. In the interior of a spherical soap bubble of radios E—R^ — 
the excess pressure is thus 

r-T ™ 


If a bubble of this size is to be in equilibrium, the internal pressure 
must exceed the outside air pressure by this amount. For an unclosed 
film, the total pressure on both sides is the same; hence the part due 
to the existence of surface tension is zero, and the differential equation 
of an unclosed film is 


( 118 ) 


The expression on the left is called the mean curvature. Surfaces for 
which this quantity vanishes are called minimal surfaces, since they 
are the surfaces of smallest area for a given boundary curve. It is at 
once evident that a stretched skin, corresponding as it does to a liquid 
film, assumes a form having minimum area. 

Mx. S3. Calculate the height to which a liquid will rise in a narrow tubs of 
radius r-j, whose walls are wet by the liquid (ooe0 =» 1). 

£x. 64. A soap film is stretched between two equal circular wire rings lying 
in parallel planes, their centres being on a line normal to these planes. The 
resulting surface possesses rotational symmetry about this line. What is the 
•form of the surface? 



CHAPTER X 


Relativistic Mechanics 

I. Space and Time in Newtonian Mechanics. 

When we spoke of the “ path ” and of the “ velocity ” of a particle 
in Chap. V we tacitly assumed the existence of a frame of reference 
(co-ordinate system) in which we could specify the position of the 
particle from instant to instant by giving certain numbers, and further, 
a means of measuring time (a clock) which marked off definite equal 
intervals of time at which the position could be recorded. A co-or- 
dinate system may be given, for example, by the walls of a room, or 
by the positions of the stars and the direction of the plumb-line. For 
a measure of time, we may use any subdivision of the day, i.e. the 
period of rotation of the earth. Actually we find the laws of me- 
chanics, as derived in the preceding chapters, verified when we use 
such reference frames and measures of time, at least to a very good 
approximation. But there are also frames of reference in which the 
fundamental law 



is not valid. For example, if we set our room, along with its fixed 
co-ordinate system, into rapid rotation, and if we allow the given 
applied forces to remain unchanged, then the paths of particles in the 
rotating co-ordinate system will be different. We can restore the 
validity of Newtonian mechanics by adding centripetal forces corre- 
sponding to the rotation to the other impressed forces. But what is 
the cause of these inertial forces? Newton gave an exact answer to 
this question. Despite the fact that his solution is based upon a fiction, 
it constitutes a foundation of the mechanics of macroscopic masses 
which has served for more than two centuries. Newton assumes that 
there exists a cosmic substratum, absolute space, which “in its own 
nature, and without reference to anything external, remains always 
similar and immovable ”. Newton continues by sayiDg that relative 
space, on the other hand, is ** some movable dimension or measure of 
the absolute space which our senses determine by its position to other 
bodies, and which is commonly taken for immovable space Thus 

* Principle, Book I, Scholium to Definition VUI. English translation by Andrew 
Motte, New York, Daniel Adee, 1846, p. 77. 
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the Second Law holds only for reference frames which are at rest in 
absolute space or, as we shall see presently, are in uniform rectilinear 
motion with respect to it. The centripetal or centrifugal forces 
involved in rotation are caused by an acceleration of our reference 
frame with respect to absolute space. Thus space, which according 
to Kant constitutes, together with thno, the perceptual form of human 
cognition, attains physical reality, and we can perceive its effect in 
the invalidity of Newtonian mechanics for accelerated frames. The 
newer views, on the other hand, lead to a negation of absolute space 
such as the ether of electrical theory. These more recent ideas recog- 
nize only relationships between parrs of matter occupying space. 
The cause of centrifugal force is taken to be acceleration not with 
respect to absolute space, but with respect to the material bodies of 
the universe. Thus, if all the stars were removed, centrifugal force 
would no longer exist. Newton also makes a similar assumption con- 
cerning the second element in the form of our knowledge, viz. time: 
Absolute time is assumed to flow uniformly, regardless of the exis- 
tence of any thinking being or any phenomenon or process suitable 
for measuring it. We see at once that this fiction of an absolute time 
also fails to stand strict examination. We need only propound the 
question: what happens if the velocity of all physical and chemical 
processes, and hence of our thought processes also, is suddenly doubled? 
Since we would have no means of detecting this, the absurdity under- 
lying the very formulation of the question becomes apparent at once. 
Hence the concept of absolute time, which assumes that a standard 
clock exists somewhere in the universe, loses its justification, for it is 
contrary to the spirit of physics to form concepts which are neither 
directly nor indirectly accessible to our senses. Nevertheless, the fact 
that Newton was able to formulate a thoroughly adequate foundation for 
the mechanics of macroscopic bodies by introducing these two concepts, 
whose difficulties could scarcely have escaped his notice, is to be regarded 
as an outstanding stroke of genius. 

% Inertial Frames. The Galilean Transformation. 

We start with the empirical fact that there exist reference frames for 
which computations based on Newton’s Second Law are in complete 
accord with experiment. Consider, for example, celestial mechanics, 
which employs a stationary reference frame at the centjA of gravity 
of the solar system. We do not now say with Newton that this frame 
is at rest in absolute space, or that it is in uniform rectilinear motion 
with respect to it (see below); we content ourselves merely with giving 
such a frame in which Newton’s laws are valid a name. Because cf 
the validity of the Law of Inertia, we call such a system an inertial 
frame. Evidently the confirmation of our calculations depends also 
upon a reasonable measurement of time. How may we obtain a cri- 
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tenon as to whether our frame of reference, together with our clock, 
represents an inertial system of space and time? For this purpose we 
perform, at least in thought, the simplest of mechanical experiments — 
rectilinear motion of a particle subject to no forces. If we divide the 
line of motion into equal segments, we can take the time between the 
passing of two successive marks as the unit of time. However, one 
direction is not sufficient. If our measured path were accelerated with 
respect to a true inertial frame, we would obtain a non-uniform clock 
which would give impossible results for other experiments. It is readily 
seen that the necessary and sufficient condition for an inertia! frame 
is that three particles projected in non-coplanar directions describe 
straight, paths. Then, by dividing the path of any one of the particles 
into equal intervals, we can obtain an inertial measure of the time. 

We now prove that if S is shown to be an inertial frame, Newtonian 
mechanics is also valid in every co-ordinate frame S' moving uniformly 
in a straight line with respect to S, provided that the same measure 
of time obtains in both systems. We shall find later that this tacit 
transfer of the time unit to the moving system is precisely the weak 
point of these almost self-evident considerations. Let the origin of 
2' move with the constant velocity u. Then if we denote by accents 
all quantities referred to the moving frame, we have 


r — v' + ut. . (1) 

Differentiation with respect to the time gives 

r-r'- vu. (2) 

r--= r (3) 


Equation (2) states that the “ absolute ” velocity r is equal to the 
vector sum of the relative velocity r‘ and the velocity u. This theorem 
is general, since no use was made of the constancy of u in deriving 
formula (2). Equation (3) states that the acceleration is the same in 
both systems, so that 

F — mr — mr', ...... (4) 

which was to be demonstrated. The transformation r~r f + iM 
which, together with the equality of the time reckoning, corresponds 
to the concept of absolute space and time, is called the Galilean trans- 
formation. If we take the i- and the e'-axes in the direction of the 
velocity of motion of S', whose magnitude is u, and if the two foames 
coincide at time t— 0, then the co-ordinate representation of the 
Galilean transformation is 

x j=b %' -j- ut, y—y', z = 2 ', t~(. . . (1 ') 

The validity of the Galilean transformation carries with it the 
.impossibility, by means of mechanical experiments, of detecting 



X-] 


RELATIVISTIC MECHANICS 


231 


uniform rectilinear motion with respect to absolute space; e.g. we 
could not determine the earth’s orbital motion, which is approximately 
of this nature over a short interval. However, the extension of these 
ideas to electrodynamics (optics) shows that an experiment is possible, 
by means of which the reality of absolute space (the ether) might be 
verified. But experiments designed to test this point yielded negative 
results (see p. 239); this led to a revision of the previous conceptions 
of space and time. 

3. Accelerated Frames of Reference. Free Fall on the Rotating 
Earth. 

We now consider accelerated motions of the frame S' with respect 
to the inertial frame E. Two special cases of particular significance 
are selected for discussion. 


(a) Reference frames in uniformly accelerated rectilinear motion 


We assume our reference frame E' to move with uniform accelera- 
tion a in the direction of the negative &-axis. Let this frame coincide 
with E at the time t = 0, and let it start from rest. Such a frame is 
represented by a falling box. The connexion between the quantities 
pertaining to E and to S' is then 


r=r’-a~/g, (5) 

r~r' — atk , ....... (6) 

r—r'— ak. ....... (7) 


In E we have F — mv — mv 1 — mak\ hence for the same impressed 
forces in S', 

mr' = F + mah. (8) 

In addition to the resultant of the impressed forces we now have an 
apparent force or inertial force mak. If a is identical with the accelera- 
tion of gravity, the inertial force exactly balances the gravity force 
—mgk. Hence the force of gravity does not operate in the falling 
chamber. On the other hand, if we imagine the chamber to be at a 
place where there is no gravitational force, and if we move it with 
uniform acceleration g in the direction of the +&-axis, then an apparent 
force of amount mg, directed along the negative ^s-axis, acts upon 
every body within the box. Thus an observer within the box would 
say that he is in the gravitational field of the earth, This equivalence 
of an accelerated frame of reference and a gravitational field is of 
fundamental importance in the general theory of relativity (cf. p. 258). 
The equivalence can be maintained, however, only if the ratio of in- 
ertial mass, i.e. the mas3 entering in Newton’s Second Law, and gravi- 
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iational mass, i.e. the mass -occurring in the law of gravitation, is the 
same for all bodies. In this case the ratio may be set equal to unity; 
this determines the numerical value of the gravitational constant. 

Following this preliminary glimpse we turn to the other important 
special case. 

(6) Uniformly rotating frames of reference 
In the previously considered types of motion the unit vectors *', 
k’ maintained their directions, and we assumed these directions to 
coincide with those of i, j, k respectively. For a rotational motion of 
S', however, the directions of these vectors change. If we suppose 
the origins of the two frames to coincide, and if we take the axis of 
rotation through the common origin, we have 

r — xi + yj + zh = *T + y'f + z'k'. . . (9) 


In taking time derivatives, the unit vectors will thus be variable: 


. dx’ 




f*' + *% + *%+< 


dk' 

IP. 


(10) 


The first three terms represent the velocity relative to S', the last 
three give the velocity of a point rigidly attached to S'. If we denote 
the angular velocity vector by to, then 


f % =wi • an 


If these values are substituted in equation (10), the last three terms 
may be combined: 


x' [to*'] + y [to/] + z' [to£'] = [to (a:'*' + y'j' + z’k-')] = [toy]. (12) 


If we denote differentiation referred to S' by d'jdt, then (10) may be 
written , ,, 

-£ +(“’’) ( 13 > 


This is true for any vector, since we can insert the components of any 
vector whatsoever in place of x, y, z. Symbolically, we may thus write 


- S+m- 


dt dt 


* (14) 


Differentiating (13) again with respect to * and applying the above 
rule, we have, remembering that to is constant, 


c Pr 
dt* 
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° r £-5^+ 2 • • < I6 > 

The first* term represents the acceleration relative to S', the third 
represents the acceleration in the system S of a point fixed to S'. 
However, it is no longer true, as in the case of velocity, that the ab- 
solute acceleration is equal to the sum of these two terms; there is, 
in addition, a third term — twice the vector product of angular velocity 
by relative velocity. This term, which occurs only when the particle 
moves in S', is called the Coriolis acceleration, after its discoverer. 
The Second Law then becomes 

¥ = m % = m IP + 2m w\ + m [» ["'ll < 17 ) 

or, in the moving system, 

rn ~ = F — 2m [w ^ j — m [u> [cor]]; . (17') 

that is, we have, in addition to the impressed forces, the ordinary 
centrifugal force —m [to [tor]] and the Coriolis force 



As an example of the forces arising from rotation of the frame 
of reference we take the case of free fall on the rotating earth. Co- 
ordinate frames rigidly attached to the earth are, strictly speaking, 
not inertial frames, on account of the daily rotation. The angular 
velocity of the orbital motion, on the other hand, can be neglected, 
having to the other the ratio 1 : 365 . However, the deviation from a true 
inertial frame is so small that it usually may be neglected. Neverthe- 
less, there are various mechanical experiments which permit the rotation 
of the earth to be demonstrated. These experiments were formerly 
interpreted as proof of the rotation with respect to absolute space; 
from the point of view of the present day, however, they may be 
considered merely as evidence of rotation with respect to the other 
matter of the universe. Take an inertial frame whose yfe-axis is in the 
direction of the earth’s axis. Let the frame S' be oriented as follows: 
Take the £'-axis in the direction of the plumb-line at the geographical 
latitude <j>. This direction is that of the resultant of gravity and centri- 
fugal force. This direction will not pass through the centre of the 
earth if we consider the earth to be spherical, but it is sufficiently 
accurate to take the angle between this direction and the earth’s axis 
to be . . 
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We also emit from consideration tlie polar flattening of the earth. 
The positive sense is taken to be that pointing outward from the earth’s 
centre. The ai'-axis is directed south- 
ward, the ./'-axis then points eastward 
(fig. 1). We then have the equation of 
motion 



d*r_ 
n dt* 


-mgk' — 2 m 


J srj- 


(18) 


The centrifugal force is already included 
in the term —mgk', since we understand 
g to represent the resultant acceleration 
riB - 1 of a falling body at the point considered. 

The angular velocity to has the com- 
ponents —o>i' cos $ and -\-tok' sin <f> in the frame S'. In co-ordinates 
we thus have 




dV „ 

n w -2m 

n » —2 mu> sin <j> 


~s 


--2mw cos <f> -s 


—mg -f- 2 m<o cos <f> 


<W 


• ( 18 ') 


Leaving the rotation of the earth out of consideration, the and 
/'-components of velocity vanish for free fall. Hence, even when 
the rotation is taken into account, the quantities dx • jdt and 
dy'jdt are small compared with ds'jdt so that we have the sim- 
plified equations 


dh? 
dt 2 


0 


dy _ 

dt* ~ 


— 2c*> cos<£ 


d*_ 

dt 


. • • . ( 19 ) 


#z' 
dt 2 = 


The first of these equations shows that no deviation occurs 
in the north -and -south direction. Integrating the third equation 
and taking 
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there results — — gt, 2 ' = — (20) 

Inserting this value in the second equation, 

^ = +2 a>gt cos <j>. ...... (21) 

Integrating twice and taking 

we obtain y' = ^wgi 3 cos<f>. ...... (22) 


This result~an eastward deviation of falling bodies, proportional to 
the cube of the time of fall — agrees with observation. 

Ex. 66. The Foucault pendulum . — Calculate the motion of a freely swinging 
pendulum bob suspended by a thread, taking into account the motion of the 
earth. (The 2 - component of the velocity may he neglected here.) 

Ex. 66. Using equation (14), express the total change in angular momentum 
of a rigid body in terms of one part relative to the body and another due to rota- 
tion of the system of co-ordinates fixed in the body. Using the result, write the 
equation M = dpjdt in terms of components of co resolved along the principal axes 
of inertia. These equations are called Euler’s equations. 


4. Moving Frames of Reference in Acoustics. The Doppler Effect 

Before discussing the difficulties which the concept of absolute 
space raises in optics, we shall consider the analogous questions for 
sound. Here the relations involved are perfectly clear and the effects 
are quite definite, on account of the fact that the velocity of sound 
in air is not too great compared 
with mechanically attainable 
speeds. We know that the velo- 
city of propagation of sound in 
air is c — 333 m./sec. In a co- 
ordinate system fixed with respect 
10 the source of sound (taken to 
be at rest) and with respect to the 
air, the sound will have arrived at 
a sphere of radius c after one |r ‘ s * 

second. Let us now lay out a 

measuring rod and move it relative to the air.- We then measure the 
relative velocity c r along the moving measuring rod, the direction 
of c r being given by that of the rod. Now the absolute velocity must 
always be c; hence if the velocity of the measuring rod is given, in 
direction arid magnitude, by V, 

c == c r + v or c r — c — v (23) 



In particular, if th9 rod is moved along its own length (position 6, 
%-2), c r — o — v", (23') 
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for motion normal to its length (position e, fig. 2), 

o r = Vc 2 - v 2 (23”) 


Moreover, we know that motion relative to the source of sound is 
accompanied by a change in pitch of the tone perceived (Doppler 
effect). First imagine the source and the observer both at rest. The 
source emits v a condensations per second; these follow each other at 
distances apart \ — cjv s . The succession of sound waves may be 
thought of as a long rod moving forward with the speed c, the con- 
densations being located on it, like a series of beads, with a separation 
equal to Hence v u — c/A 0 = v s is the number of condensations 
passing the position of the observer each second. If we now permit 
the observer to move toward the source of sound, then he receives an 
additional number of vibrations per second equal to the number of 
waves in the distance he covers per second, i.e. a number v)\. Hence 



For motion away from the source the positive sign in the parenthesis 
is to be replaced by the negative. 

The Doppler effect due to the motion of the source is essentially 
different, although the effect is in the same direction, i.e. the pitch is 
raised when the distance between source and observer is decreasing, 
and lowered when the distanoe is increasing. The situation may be 
analysed in this way: If the source were at rest, then the first dis- 
turbance — say a condensation — would be at a distance Af, from the 
source after the time of one vibration had elapsed. But the souroe 
has moved on a distance vjv — ( vjc)\ in this time, thus decreasing 
the distance between the source and the first condensation by ( vjc)\ . 
This decrease holds for all distances between condensations; for after 
another vibration, the first disturbance has progressed a distance Aq, 
but the second starts from the new position of the source, and so the 
distance between the first and second condensations, is again decreased 
by {vjc)\, and so on. Fig.- 3 shows how this oecurs. The number of 
vibrations passing the observer each second is thus 


c o 

A' “ Ao(l - v/ef 



If the source moves away from the observer, the positive sign must 
be used before the second term in the denominator. Comparing this 
result with the case where the observer alone was in motion, we see that 
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bhe two are equivalent only to a first approximation, i.e. when higher 
powers of vjc can be%eglected. The reason is that the medium carrying 



the sound (the air) provides a special co-ordinate frame S to which 
all motion is to be referred. One sees at once that the two cases are 
essentially different by taking v = c. In this instance we have v M ^= 2v s 
for motion of the observer, while if the source is moving, 1 ^ = 00 . 

5. Moving Frames of Reference in Optics. The Michelson-Morley 
Experiment. 

According to the older ideas, absolute space — called the ether in 
electrical theory — is the carrier of electromagnetic and therefore of 
optical phenomena. It is therefore natural to apply the results ob- 
tained for sound to optics. Actually, however, this procedure leads 
to unsurmountable difficulties. The existence of an optical Doppler 
effect of approximately the magnitude given by equation (25) has 
been observed in the spectra of stars moving toward or away from 
the earth, as well as in the light from canal (or positive) rays — fast- 
moving luminous atoms (cf. p. 441). If we could increase the velocity 
of the light source, or of the observer, or increase the accuracy of 
measurement to the extent that quantities of the second order in vjc 
could be measured, then we would be able to test the existence of 
the ether. As we saw in Acoustics, if we have a mediurri at rest, the 
second-order quantities show that the Doppler shift depends not only 
upon the relative velocity of source and observer, but on the velocity 
with respect to the medium. Unfortunately, there is little hope of 
securing an experimental decision concerning the reality of the ether 
concept in this way. Only recently has it been found possible to in- 
crease the accuracy of Doppler-effect measurements on positive rays 
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to the extent that second-order quantities can be measured (see p. 246), 
It is easier to carry out, with sufficient accuracy, an experiment by 
which the velocity of propagation with respect to a frame moving 
through the ether ie investigated. Since light is propagated in the 
ether with velocity c, the velocity of light relative to a system moving 
through the ether with velocity v is c — tr. A direct measurement or 
comparison of the velocity of light in various directions on the earth, 
which (considering only the orbit about the sun) moves uniformly 
with a velocity of about 30 kin. /sec., is again, it is true, beyond experi- 
mental possibility. A comparison is made possible, however, by using 
the light itself, in a somewhat different arrangement, to measure the 



v anishing ly small time differences. This scheme, which had already 
been suggested by Maxwell, was first utilized by Michelson and Motley 
in 1880. It has since been tested many times by various investigators. 
In this arrangement (fig. 4) light from the source L is divided into two 
beams, normal to each other, by the half-silvered plane parallel glass 
plate <?. Each beam covers the distance l to the mirrors \ or S z , and 
is reflected baok over its original path, Assume the arm 1 to be in the 
direction of the earth’s motion; then the relative velocity of the light 
is c — v on the initial path and c + v on the return. Hence the time 
taken to pass to and fro along this arm is 


‘.= 0 4 i +4-.=f( I+ ^ + "-> • • < 26 > 


I n the direction normal to this path the relative velocity of light is 
V c 2 — u®; hence the time required for the double journey along the 
arm 2 is 
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Thus there exists a difference— a 
times: 

At — ty — 


very small one — between the two 



If we take the orbital speed of the earth to be 30 km./sec. and let the 
path l be 30 m, long, At — 10~ 15 sec. In spite of its smallness, this 
difference can be detected if we use the light vibrations themselves 
instead of a material clock for this measurement. Now the number 
of vibrations N described by yellow light (A — 8 X 10~ 5 cm.) in 10~ 15 


At 


= 10- 1 


3 . I0 10 


■ 6710- 




(29) 


The difference is thus one half of one vibration. If the apparatus is 
turned through 90° the two arms interchange their rdles, and the 
difference has the opposite sign, so that the entire difference for two 
positions 90° apart amounts to one complete vibration. Since both 
partial beams originate in the same light source, they interfere when 
reunited (see p. 410) and we obtain a system of interference fringes 
which may be thought of as “ curves of equal thickness ” in the wedge 
formed by S t and the image of S 2 produced by reflection in the semi- 
silvered plate. Starting with the arm 1 in the direction of the motion 
of the earth, we have a definite system of fringes. If, now, the appa- 
ratus is turned through 45°, the two arms are symmetrically disposed, 
and the original time difference of T/2, corresponding to a phase 
difference of n, vanishes. The places which were originally dark are 
now bright, i.e. the fringe system has been displaced by half a fringe 
width. If we turn the apparatus further in the same sense, the opposite 
difference obtains, and when the rotation amounts to 90° the dis- 
placement is an entire fringe width. This is, of course, subject to the 
assumption of the numerical values adopted above. The result of the 
Micheison-Morley experiment, conducted under approximately these 
conditions, was the absence of any such fringe displacement. When' 
in 1925 , D. C. Miller reported a positive effect corresponding to a 
velocity of 10 km./sec., verifications were attempted by various in- 
vestigators. Joos, working with improved apparatus at the Zeiss 
plant in Jena, was able to show that if a positive effect does exist, 
it must be smaller than one thousandth of the width of a fringe. Fig. 1, 
Plate I, shows a record of the interference fringes. This record was 
obtained by moving a photographic plate in front of a 0-2 mm. -wide 
slit mounted normal to the fringes, the entire apparatus being rotated, 
meanwhile. Instead of a sinuous line repeating after one half turn, 
we see that the stripes are perfectly straight. Experiment ha,s thus 
decided against our acoustic analogy and against the existence of a stationary 
medium carrying light; i.e. the existence of a cosmic ether or absolute 
space is disproved. 



240 MECHANICS [Chap. 

6. Tits Relativistic Conception of Space and Time. The Lorentz 
Transformation. 

Disregarding hypotheses which are contrived ad hoc, and which are 
therefore unsatisfactory, the result oi the Michelsou-Morley experi- 
ment leads compulsorily to a revision of oui concents concerning space 
and time. The negative result oi this experiment makes it very evident 
that absolute space is a fiction. But absolute time must also be aban- 
doned. The definitions occurring in the treatment of time by the old 
mechanics are imperfect. For example, what is the meaning of the 
statement: “ Two events took plaice simultaneously at A and B ”? 
If vpe postulate an absolute, uniformly -flowing time, there can be no 
doubt as to the meaning of this statement; but phyrics demands an 
experimental criterion of the correctness of this statement, and this is 
where the difficulties begin. We must have synchronized, clocks at 
the stations A and B which show the absolute time. Then the events 
may be said to be. simultaneous if the clocks indicate the same time at 
the instant when the, events occur. But hew can the clocks be syn- 
chronised? An arrangement which suggests itself is the following: 
Place two mirrors, inclined at 45° to the path AB, at the mid -point G 
of AB, sc that an observer at C can see the two stations A and 3 
simultaneously. We shall say that the two clocks are synchronized 
if they indicate the same time at every instant, when seen at O'. But 
from the standpoint of absolute concepts, they will not be exactly 
synchronous even in this case, tor the experiment is performed on the 
earth, which moves relative to the ether; hence the light by which 
we see the clocks will require a time lf{c — v.) to come to its from one 
clock, and a time ?/( c+t>) to come from the oilier, assuming the path 
SB lot length 21) to be in the direction of the earth’s motion. What 
value is to be put for vl There is certainly a velocity of the solar system 
relative to the fixed stars to be added to the orbital velocity of the 
earth, and probably a very large velocity of the galaxy with respect 
to the extra-galacric systems. From the absolute point of view, how- 
ever, there may also be a common motion of all matter with respect 
to the ether. The result of the Michelson-Morley experiment relieves 
us of the necessity of answering this question, for it shows that the 
velocity of fight is me same in all directions on the moving earth. 
Therefore the above experimental criterion for fche^syaohronization of 
the clocks of a system remains, intact. Bat, as was first recognized by 
H. A. Lorentz, the result of the Mieheison-Morley experiment requires 
that two systems in relative motion have two different time measures, 
for it is observed in each system, that light reaches a sphere of r&dius 
c after one second. Hence the Galilean transformation, which sets 
l' ~ t. cannot be correct. Its insufficiency was not recognized sooner 
on account of the fact that the velocity of light is so great in com- 
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parison with all meehardeai velocities that the deviations can be de- 
tected only by the most refined optica} measurements. The fact that 

she Galilean fransformation is . incompatible with the result of the 
Michelson-Morley experiment may be seen by applying the trans- 
formation to a spherical wave observed in & system at rest. The result 
is the equation of a sphere eccentric with respect to the origin. We must 
therefore transform f, along with the other quantities, in such a way 
that the transformation formula? give a spherical wave for the propa- 
gation of light in every frame of reference, i.e. the identity 

SB* + f 4- a s — c 2 * 2 3 s' 2 4- y 2 -j- z' % — c¥ a =. 0 , (30) 
must b9 satisfied. 

For the sake of simplicity wo take the *-axis in the direction of 
the relative velocity of £ and S', thus permitting the z-uxis to coin- 
cide with the s'-asis. We then make the 
obvious assumption that the ferae averse co- 
ordinates remain unchanged. Thus the iden- 
tity reduces to 

X 2 — c¥- 3 sc' 2 — cH' % — 0. (30') 

Since there is but one relative velocity v, 
the transformation formulae mart further fulfil 
the condition that O' has the co-ordinate vt 
in S, and that 0 has the co-ordinate — vt' in S' (fig. 6). The 
simplest relations which accomplish this end are 
a/=*k(x~vt) 

and x— u') j’ ' ' 

We eliminate x' from the second equation by inserting its value 
obtained from the first, and find 

< 32 > 

Putting the values of a:' .and f in the right member of (30'}, we have 
k 2 — c 2 * 2 -- £*( x 2 -- 2xi tf -f vH 2 ) 

+ **[* ~ ? 0 ■ " a) + t( l ~ hf\ • «■ ™ 

The identity requires that the coefficients of x s , xl and t 2 vanish sepa- 
rately; to accomplish this, we have only the two constants k and k‘ 
at our disposal. The coefficient of x 2 yields 


o\+— vt - 
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The coefficient of fi requires that 

-c 2 __ kW + Fc 2 = 0 or k 
The coefficient of xt gives 

(v 2 - c*)kk‘ + c 2 = 
or, after inserting the value of h, 

V - l 

v ( i - w 

It is now easy to verify that these values of k and h' make the co- 
efficient of sc 2 vanish also. 

We thus have the formulae of transformation 




If we let o tend to infinity we obtain in the limit the Galilean trans- 
formation, which is a first approximation to the Lorentz transfor- 
mation for finite v. The ratio vjc will, as is customary, be denoted by 
ft in what follows. Using (34), we readily see by differentiation that the 
speed dx/dt of O' with respect to 2, and the speed dx’jdt of 0 with 
respect to S', are both equal to v. 


1 

a/( 1 - t-Vc *)’ 
= o, 


7. Immediate Consequences of the Lorentz Transformation. 

(a) Relativity of simultaneity and of the order of events 

Let two events occur simultaneously at the points ac* and ® a in 
the frame 2, i.e. let ^ = t 2 . Then 


h c 2 x i h -2 X » 

h “ VT-W ** = 


. . (35) 


i.e. If and tf are different. Two events at different places and x a 
which are simultaneous for an observer at rest in the frame 2 no longer 
appear so to an observer moving relative to 2. We ask further: 
“ May even the order of two events depend upon the state of 
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motion of the observer?” Let us take t 2 > i.e. L — <j > 0. Then 


V - h' 


- h ~ | (*2 ~ *1) 
VI - p ' 


. (35') 


In order to obtain a real time difference t 2 — V at all in the accented 
system, the velocity v of 2' must b» less than c; t 2 — certainly 
remains positive as long as 


Now we shall see below that the greatest velocity with which an agency 
connecting a cause and an effect can be propagated is c. If there were 
a more rapid process, we would use it in place of light to define simul- 
taneity. Hence, as long as c is the greatest velocity attainable, the 
order ol two causally connected events cannot be reversed. However, 
the order of two events which are not causally connected may be 
reversed consistently with the above relationship. 


(b) The Einstein time dilatation 
Let a clock at 3\ give signals at intervals 
A t=t 2 — ij. 


Seen from the moving system, the interval will be, by equation (34), 


A t' = 


t 2 — t x __ A l 

vr- ~jp vr-w 


(36) 


Thus the interval A t appears to the moving observer to be lengthened. 


(c) The Lorentz contraction 

We perform a measurement of length by laying a measuring rod 
alongside the segment to he measured and reading the interval be- 
tween the division marks which coincide at a given instant with the 
ends of the length to be measured. As long as the measuring rod and 
the interval to be measured are at rest with respect to one another, 
this definition is trivial; not so, however, if there exists a relative 
velocity v between the two. Let the distance l — x 2 — between the 
ends of the interval be measured by means of a linear scale at rest in 
this frame. According to equations (34) (p. 242), a moving observer 
will note 
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The times % and i 5j, however, are cc be determined in such a wa; y that 
coincidence with the scale divisions occurs at the same instant in the 
observer’s system, i.s. we must have, not t t ~ i-, but — t x '. From 
the Lorentz transformation we have, for 2*' i,% 

h h ““ \®s ~ ®i) 

and so V — x t ' — xA (z s — a^J-v/T — "i§*. - . . (SS) 

Seen by the moving observer, linear dimensions are reduced in the 
ratio Vl~ £*: I. A body which appears to be spherical to an 
observer at rest relative to it will thus appear to a moving observer 
to be an oblate spheroid. 


(d) Einstein’s velocity addition theorem 

In Newtonian mechanics, relative velocities are found by simple 
addition or subtraction (of. § 2, p. 229). The composition of velocities 
in a mechanics based on the Lorentz transformation is more involved 
Let a frame S' more relative to 51 in the direction of the ar-axis with 
velocity v. Let a point in S' have the velocity 

u’ = 4- <f 4- 

wk “* ■ ■ • <*» 


What are its velocity components u m ii„ in 2? It follows from the 
Lorenta transformation, by differentiation, that 


dx' At! dt' 11 _ 

it 'dt ' ' 'dt , a: _ 1 e® ‘ 1 
41114 '<& “ 

(«■’ 4- »){i — 

« a « 


W* — - 


« ,* + O 


(AO) 


Solving for w* 


(41) 
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and 


Similarly' 


dy = dff ay dt[ ^ B, y 1- 
<& dt dt' dt vuJ 


.VI -fr 


. (43) 

• (44) 


Oxi account ox the denominator of (41), the sum of two velocities which 
are separately less than c can never exceed the velocity of light. The 
Einstein velocity addition theorem gives the simplest explanation of 
the index of refraction of moving bodies (the Fizeau Effect, p. 477). 

Despite the fact that the velocity of light appears to be an upper 
limit for velocities, it nevertheless eennot be denied that velocities 
greater than c are known. In optics, for example, there are media 
with index of refraction n < 1. This means that the wave velocity « 
is greater than c. But signals can be given only by pulses, or groups 
of waves, which are propagated with the so-called group velocity (see 
p. 65). Semmerfeld has shown that this velocity is less than or equal 
to c for all media. Another example of such a geometric velocity ex- 
ceeding c is the following: Let a long ruler make a very small angle 
6’ with the as-axis of a rectangular system of co-ordinates. If, now, 
the ruler be moved in the ^-direction with a velocity v, the point of 
intersection with the ic-axie will move with a speed vjt&n 5. It is 
evident that this velocity can be made to exceed c- by taking 8 suffi- 
ciently small, yet such geometric velocities in excess of c can never act 
as connexions between cause and effect. 


(e) Doppler effect and aberration 

Let a source of light L', at rest in the frame X', send out a spherical 
wave in this frame. Let the wave be 
represented by 

= . , (45) 

where all quantities observed in 2' are 
denoted by accents. Let this spherical 
wave meet an observer P at rest in a 
system S. Let the co-ordinates of P be 
x and y in 2 and as' and y in S', taking both the as- and 
x'-axis in the direction of the relative motion of S' with respect to 2 
(fig. 7), In 2', let the ray from L’ to P make an angle 6' with 
the as' -axis, so that 

r' — as' cos 8’ -f- y‘ sind' 




and hence 


S' = 
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(46) 


4. gSmW-fcf cos«'+y linOW+tf 

The observer in £ interprets the. approaching wave surfaces as coming 
from a point-source of light belonging to his system, so that the re- 
lation which holds for him is 

S ~ - 3 2w(t-(*co>9+ysi;i«)/c)'i-«S j .... (47) 


Moreover, the co-ordinates and the time in the two systems must be 
connected by the Lorentz transformation. If we employ these re- 
lationships to express the accented quantities in terms of the un- 
accented ones, then comparison of S with S' must yield a relation which 
is valid for every co-ordinate and for every time. This is possible 
only if the coefficients of x, y and t in the exponents are separately 
equal. Thus from 


~( 


(x — vt) cos O' y sin#' | 

■yT~]p ~ TvT-^ c ~ ' j 


= 2m < 

b obtain, after reduction. 


, 1«+ (v/c) cos#' 

v Vl~fP ~ 


(48) 

(49) 


cos O' -|- v/c 
~ I + (v/c) cos 0' ’ 


9'Vl~ 

1 -j- (v/c) cos O' ’ 


tan# — 


sin#Vl — 
cos # / + v/c 


(50) 


Equation (49) gives the exact expression for the optical Doppler 
effect; this agrees with the acoustic value to terms of second order. 
It is worth noticing, however, that for transverse observation ($ ~ irj 2) 
there exists a second-order effect here, while in acoustics v — v’ ex- 
actly. By (50), cos O' = — v/c for 6 *= n/2, and hence (49) gives 
v—v'y/l — fp; thus a shift toward the red takes place. Unfortu- 
nately, no experiment thus far attempted has been able even to ap- 
proach the accuracy needed for the observation of this effect. On the 
other hand, H. E. Ives was able to increase the accuracy of measure- 
ment of the longitudinal effect as far as second-order terms, by ob- 
serving in both directions along a beam of positive rays and measuring 
the displacement of the mean with respect to the frequency of the 
.soiree at rest. By (49), the mean value is 

*W/V(1 - /s*) = v'( 1 + ifP), 
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whiie from (24) and (25), v = *>'(! -f /J 2 ). 

The experiment decided in favour of the former value. 

The meaning of equations (60) is the following: If we neglect 
second-order terms, we see from the figure that the source is seen 
from S at the position it occupied when the wave just arriving at P 
left the source, i.c. it is seen where it was at a time r' jc earlier. This is 
the so-called time of retardation. In this time the source has moved 
a distance vr'jc, and from fig. 7 we have approximately 
cos 0 » [i ' co3 O' -j r'vjc] j { V ~j- (r'vjo) cos 0], 

Putting cos O' - -- cos 0 in the small second term of the denominator 
wc obtain (50). If the two systems remain , in uniform rectilinear 
motion relative to each other we do not notice this displacement, but 
since the earth reverses the direction of motion in its orbit every sis: 
months, the displacement of the apparent position of a star also reverses. 
This is the phenomenon of aberration; first observed by Bradley in 1728. 

The ordinary explanation of aberration is baaed on a ballistic analogy. If 
the ray of light were a projectile making a hole in the objective and in the eye- 
piece of a telesoope, the line joining the two holes would, on account of the motion 
of the telescope, deviate from the true direction by the angle computed above. 
Since aberration is a first-order effect (i.e. one depending on y/’c), it must follow 
correctly to this approximation from the older mechanics and electrical theory 
as well. 


S, Geometric Representation of the Loreaiz Transformation. The 
Fotir-dimensioaal World. Calculation with World-vectors, 

In order to compare the transformations of Lorentz and. Galileo 
we start with a very simple device — a graphical one-dimensional time- 




table (fig. 8a). We choose as abscissa not the time itself, but the pro- 
duct ct, which vs denote by m. As ordinate we select the distance x 
traversed on a line which we take as the cc-axis. Uniform motion will 
then be represented in this diagram by a straight line of slope /3 = vjc. 
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A light signal is thus represented by a line of slope unity. Now let all 
quantities be converted to a system moving with velocity v. If the 
Galilean transformation ( = t, x' ~ x — vt holds, this means that wo 
take » new ja'-axis of slope /3 in the diagram, the ar-axie remaining 
unchanged. If the unit on the old m-axis is giver* by the segment OM, 
then the unit on the m'-axis is given by OM', where If' is the inter* 
section of the m'-axis with a line through M parallel to the w-axis. The 
new co-ordinate system is oblique, but the a-axis is the same as before. 

The situation is different if the Lorenta transformatica holds in 
place of the Galilean. As we shall show immediately, the new axes 
and unit segments are obtained m follows: We draw (fig. 86) the 
hyperbolas 

m s — b 2 — I and m? — x % =■ — 1, . . (51) 

whose asymptotes m* — b® «= 0 correspond so the propagation of light 
signals. The n®w m'-axis is drawn as a line of slope as above. The 
onit po ; nt M'\ however, is now the intersection of this line with the 
hyperbola m a — *=> 1. Corresponding to the circumstance that the 

time must also be transformed in the relativity theory, a new V-axia 
must also be found. This is the line of gradient Ijfi, i.«. the lino 
symmetric to the m'-axis with respect to the fine of slope unity. The 
unis along the sc'-axis is determined by the point- of intersection X" of 
this axis with the hyperbola m* — a:* — —1. We proceed to show 
that these constructions make the transformation from the rectangular 
m, x system to the oblique m', x' system equivalent to the Lorentz 
transformation. First of all, the transformation to oblique co-ordinates 
with the same origin is given by a linear homogeneous transformation 
of the co-ordinates: 


m Am' -f- j.ix\ x vm' -j- px\ , . , (52) 

We determine the coefficients by means of the unit points. For M' 
we have m! = i, x' = 0, that is, m = A. Again, for the abscissa of the 
intersection of the line x — 8m with the hyperbola m 2 — a? 2 =* 1 we 
have \jVT~-- fP. Hence A= 1/Vl — /3 s . For the as-co-ordinate of 
the same point, x~ v™ j 8/VT— ■ /S 2 . In the same way, from 

the co-ordinates of the point X', 


r , O e C r ~S X } 

We thus have m ~~ or t — — 7 =£= I 

Vi - P Vi - p l 

x ■+■ fim' . x' -f- vt' 

Vi-P Vi- fi 2 i 


• • ( 63 ) 
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This representation provides an interesting partition of the 
gfrcontinuum. Since the maximum possible velocity for an agency 
connecting cause and effect is c, the straight line representing such an 
instrumentality can have, at most, a slope equal to unity.' Hence if 
in fig. 8b the point 0 represents the present, then in the shaded area to 
the tight of 0, we find all events which are causally conditioned by 0; 
this region therefore represents the fuiwre. All events which them- 
selves have conditioned 0 are found in the shaded area to the left of 
this point— the past. Events whose representative points (“•world- 
points ”) lie in the unshaded portions of the plane cannot stand in any 
causal relation with 0. These points, and only these points, may be 
brought into simultaneity with 0 by transforming to a moving system, 
since we can pass an af -axis through such points, thereby making 
-n' --- 0. If we include the other co-ordinates y ’and a of three-dimen- 
sional space, the asymptotes become the lateral surface of a four- 
dimensional cone x l -f y 1 -f- ^ — m? — 0; this does not change any 
essential item of our discussion. 

There is, however, another way of representing the situation which 
is much more important for the formal analysis, but which cannot be 
visualized as well as the above representation, since it employs com- 
plex numbers. If we introduce the co-ordinate l in place of m «= at, 
where l = id, 

we have & + f + » a + l 2 as ®' 2 + y'* + z' a + l' 8 . . . (54) 

This means that the square of the radius vector remains unchanged 
under a transformation of co-ordinates in the 
four -dimensional x, y, z, l apace. As in the 
three-dimansional case, the only transforma- 
tion, aside from reflection and translation, 
which leaves r* unaltered is a rotation of the 
co-ordinate system. Actually, it can be shown 
that a rotation of the x, l system of axes 
yields the Lorentz transformation. If the 
axis forms an angle $ with the 1-axis, 
then, according to the familiar relations of 
analytic geometry, 4 * 

V =s leos$-f ajsin^ ) 

&' as —l gm<p 4 * COS (f>) 

* This transformation, corresponding to rotation of » »Ua8 fcwo-cIfmetMioiid 
system of co-ordinates, may bo Verified readily by writing the wrpreasioa for the 
cadiaa vector in both systems; 

r - « -l- xi - VI' + xT. 

From fig, 9 we see that 

l — /' cos <j> - i' ma<j> and i •» l'sin$ 4 i‘ n »c4. 

Inserting these values in the equation for r and equating the eoaiScisnts of V and i‘ 
to stem separately, we obtain (&5). 
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If we set 
that is, 


tan 4> -- —ifi, ...... 

Rxn ^vT^p'- ' • 


(66) 

(57) 


we obtain 


x ~f *j8Z 

Vi=p 


I — ipx 

*vrrp 


Vi - p 


t? 

Vi~ 


pi j 


(58) 


i.e, the Lorentz transformation. 

Thus we have the important relationship discovered by Minkowski, 
that the transformation from one system of reference to another 
moving relative to the first with a velocity v corresponds to .a rotation 
of axes in the four-dimensional x, y, z, I world, the angle of rotation 
being given by equation (56). This makes it possible., by extending 
ordinary vector analysis to four dimensions, to derive generally valid 
laws' in the form of equations between four-dimensional vectors. In 
what, follows, these vectors will be called world-vector?., and will be 
denoted by bars beneath their symbols. Just as the ordinary vector 
equations representing the laws of Newtonian mechanics may be re- 
solved along any co-ordinate system, so the co-ordinate resolution ol 
world-vector equations yields a representation for a given system of 
reference. Resolution according to a system turned with respect to 
the first gives the representation for a reference system in morion 
relative to the first one. 

The following rules hold for calculations with world-vectors: Addi- 
tion and subtraction are defined aa she same operations performed on 
corresponding components. Graphical addition would, of course, be 
possible also in four dimensions, according to the methods of descriptive 
geometry. Since the four-dimensional co-ordinates are also orthogonal, 


a -- i, jj =. i kk ~ i, u =--- i, ij o, ik— o, 

il 0, jk -■= 0, &c. (69) 


Bence the scalar product of two world- vectors 

u a» u x i f u v j + ujz- •+■ u x l and v ~ v a i + %j -f v *& + *-V r 
is defined as 

uv u m v s -j- -}- ujo, -j- up* . . - ( 60 ) 


The definition of the vector product is somewhat more difficult, since 
this combination really signifies an antisymmetric tensor, and it is 
more or less fortuitous that such a tenser in three dimensions can be 
represented by a vector. In four-dimensional space an antisymmetric 
tensor has six components, a vector has four. We define as the vector 
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product [uv] of two world-vectors the antisymmetric world-tensor 
<3>, whose components are the determinants of the matrix 


€> x ,j — u x v y — u,jV m <& at = u a v B - 
== u„ v„ — «»«*, ® v j = U v V t - 


- «>v, f 


(61) 


Co'cresponding to the three-dimensional A-operator we have a four- 
diinensional operator 

. 0 0 0 


*5S + -^’ + *5E +/ 
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whose combination with world-vectors or scalars gives, in complete 
analogy with the three-dimensional case, the following differential 
operations: 

g i d»_ au =.'|H + y=+ 

_ 3« a 3«,, 

‘"B-a'—gj+iSj 

jll 2 

curl w = [□ £»J — |j 055 0$f 


In place of the equation div curl v - 
complicated property of world-tensors formed by taking the curl of 
world-vectors; viz. if 4> = curl v, i.e. 

. dv v dv x 

dVy - __ 0C; OVy 

' Tz'> 


S + *|S + /?£ . 

• (62) 

, dt\ , dv j 
+ 0Z+-0T * * 

• (63) 

0 d| 

f dz 01 1. . . . 

• (64) 



-- 0 we have a somewhat more 




dx 

_ 

" 3y 


dx 


dx 

. _ 
" di 


then we have 


0Z 

0d>„ t 

01 


d<P„ 

~0F~ 


0y 


= 0 1 


dz 


§*sl , ?£» . ^SS 

a® r a« T 0i 


ay 01 ' 


(66) 
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9. Newton’s Second Law in the Theory of Eelativity. The Variability of 
Mass and the Inertia of Energy. 

The Galilean transformation forms the basis of Newtonian me- 
chanics. It is oyr task to modify the laws of mechanics in such a way 
that they become compatible with the experimentally confirmed 
Lorentz transformation. It is true that the velocities which are met 
with in the mechanics of macroscopic bodies are so small that proof 
of the correctness of the transformation by means of terrestrial ex- 
periments has so far not been found possible; but we can impart to 
electrically charged particles (electrons) velocities approaching c, 
which therefore permit of a test. The formulation of the relativistic 
basis offeneehanics is usually approached from electrodynamics, but 
it is also possible to derive the new laws from purely mechanical con- 
siderations. Adopting the latter method of approach, we have to 



Fig. .o 


introduce the concepts of quantity of motion (momentum) and of 
energy into the mechanics of the Lorentz transformation, and de- 
termine the corresponding mathematical expressions in such a way 
that the laws of conservation of momentum aud of energy are re- 
tained. 

The simplest process whose result is conditioned by these laws of 
conservation is that of the impact of two equal spheres. Following 
Tobnan, we use this case for a hypothetical experiment. Imagine an 
observer located on the y ~ axis of a frame or system E and another 
observer on the y'-axis of a second system S'. Let the first observer 
throw a ball along the y-nxis, the second one along the negative y'-axia 
(fig. 10). The masses of the two spheres are to be equal, when measured 
in the same system. Assume the two velocities of projection, as 
measured each by the corresponding observer, to be equal and oppo- 
site. Let a collision take place as a result of the motion of S', and let 
the instants at which the spheres are thrown be so chosen that the line 
of centres at the instant of impact has the direction of t-he y- axis. 
SinceHhe spheres are assumed to be smooth, they cannot exert tan- 
gential forces on one another, and so the ^-components of the veloci- 
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ties axe unchanged by the impact. If we denote the ball thrown by 
observer A in system S by a, the other by b, then, according to the 
role for the addition of velocities, the velocity components before 
impact are 


as seen from IS: as seen from S': 

W a0 — 0 tC'aa = ~ V. 1 



Let all quantities after impact be denoted by bars over the symbols 
In order to be able to apply the law of the conservation of momentum 
we make the assumption that, in relativistic mechanics also, the mo- 
mentum vector has the direction of the velocity vector. Here, how- 
ever, we replace the scalar factor which represents the constant mass 
of the sphere in Newtonian mechanics by a variable multiplier a. 
Being a scalar, this can depend only upon a scalar quantity, say the 
magnitude w of the velocity. Under the above hypotheses, the s> 
component, in the system 2 yields 

-«<«>■ . (67) 

or **=±w. . (68) 

Newtonian mechanics requires that m~ — u in this case. Since 
relativistic mechanics must reduce to the Newtonian fom for smn.Il 
velocities, we must use the negative sign hero. Further, the y-com- 
ponent yields the equation. 

.(«)■»-. ( > /e + g(i-gj).« A /i-g ^ 

- a(u) . u - a(^Ji? + »» (l - 5)) ' “ V 1 “ $ < 69 > 

This is satisfied for u -- — u if 


(?0) 



'II we now left the velocity of projection become smaller and smaller, 


«(»)=* 


7 


( 0 ) 

=•' 7* 


(71) 


For very small velocities, however, we must arrive at the Newtonian 
expressions, in which the scalar factor a is the mass m 0 . II we call 
this scalar factor the mass in the relativity theory also, then the result 
Steves that the mass increases with increasing velocity. Denoting 
the mass corresponding to a state of rest by m 0 , we obtain the funda- 
mental relation 



In what follows, the quantity m will be mors accurately termed the 
mmmntal mass or the apparent mass. 

We new formulate the fundamental law of mechanics in the original 
Newtonian form, in which the force is set equal to the time rata of 
change of momentum: 


s(«»®) = 



(73) 


This differs from the old iprm in that m may no longer be placed 
before the sign of differentiation, since it is now variable. 

According to this equation, the inertial mass, he. the resistance 
offered by a body to acceleration, is not now identical with the ap- 
parent mass; in fact the inertial mass in a frame of reference in which 
the body has a speed v will actually have different values in the two 
cases when the acceleration takes place in the direction of the initial 
velocity* and normal thereto. To show this, wa take the x-nxh of 
our co-ordinate system in the direction of the velocity, while the 
acceleration is taken to be in the ay-plane, i.e. 


dx 

Jt 


d?x 

<P : 


&y _ , dh , 
(P “ *'*" e& ' 


Carrying out the differentiation in equation (7S) 'and putting v# — v, 
ws have 
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Thus the inertial mass occurring as the factor of proportionality in 
the equation connecting force and acceleration has the magnitude 




for the aHSCinponent. and the magnitude 


for the ^-component. This means that the fores vector and the 
acceleration vector have different directions; the, acceleration makes 
a greater angle with the velocity than the force does, since the longi- 
tudinal inertial mass is greater than the transverse inertia! mass. 

The formulation of the Second Law given by equation (73) pre- 
supposes a particular frame of reference. For a frame in motion rela- 
tive to this frame, the quantities entering in (73) must be transformed 
according to the Loient-z transformation. This is unsatisfactory from 
the standpoint of the relativity theory. A generally valid relationship 
ought to have the form of an equation connecting world-vectors, 
which may be resolved (i.e. split up into components) in any four- 
dimensional co-ordinate system, so that the relationship remains valid 
for any given frame, or for any frame in motion relative to it. We 
now attempt to formulate such relationships. First, we note that at 
any g>ven instant one special frame is marked out from all. others, 
viz. thar which has the same instantaneous velocity v as the particle. 
Wo shall call this the rest-frame. The time in this frame we call the 
local time or proper time r. Hence for a frame in which the velocity of 
the particle is v, and the time is t, we have, by equation (38) (p. 243), 

dt 1 


Take now the four-dimensional radius vector by which the position 
and time of the particle are specified: 

r = xi +• yj + zk -f I/, 

where, as at (54), p. 249, i — ict. By differentiating with respect to 
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tike proper time, we obtain a new world- vector q which we call the 
fmr-iHdccity. For a'particle at rest, this vector has the value g= iel, 
on account of the vanishing of the derivatives dzjdr, dyjdr and dzjdr. 
The square of the magnitude of the vector q which is independent of 
the so ordinate system, is — o®, a constant for all motions. In another 
door-dimensional system of co-ordinates with time t, in which the 
particle has the velocity V, 


The spatial components of the world-vector are 
system by the components of the ordinary vector 


nd the time component (J-comooneafc) is given by 


Hans® in the above form of the Second Law, in (73), p. 264, we can 
put the spatial components of q, combined into a vector q„ in place 


* *“ | (»**V) “ V 1 - | K<Tr). ' - - Cf8) 

We now introduce in place of the force F a world-vector P — the st four- 
force ” — whoa® spatial components are identical with those of F in 
the sest-fcrama, but which satisfy the relationship 


in another reference frame. Then the Second Law has the familiar 

form 

£“a; <"•*>• •••••• C 30 ) 


This form leads to a further important result. 
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If, in analogy with the method of proving the theorem of the 
conservation of energy in classical mechanics, we taka the scalar pro- 
duct of bosh members of (80) with g, we obtain 

?£ « Tr (”W)g - i (*hg*) - 0- • • • (81) 


The right side vanishes, since the magnitude of q is the constant ic. 
If she left member be written in components, the spatial components 
being combined into ordinary vectors, we have 

P r g r -t-P,g,= 0, 


(vr: 


i)fc) 



The product Wv is the work done per unit time. If ws replace P } 
on the left side by the value 



obtained from equation (80), we obtain 


~Vi- 


dN 

s w “y* 


The quantity N introduced here has the value 





i®(i Is 


to which a constant of integration must still be added. If wo give 
this constant the value — «t 0 c 2 , we see that the development of the 
radical is a first approximation to the kinetic energy T of Newtonian 
mechanics. Thus the conservation of energy retains its usual form in 
relativity: the work done per second is equal to the increase in kinetic 
energy per unit time. 

What is the meaning of the quantity *%e 2 , which has the dimen- 
sions of energy? If we take the constant of integration to be zero, the 
term m^c 2 represents the energy of the particle at rest. This energy 
has an enormous value. It is natural to interpret this so mean that 
inertial mass represents a large accumulation of energy and that, 

1<? (STU) 
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rec’piocally, to every amount of energy E there corresponds an in- 
ertial mass p 

m 0 = ~ ...... ( 88 ) 

This is the Principle of the Inertia of Energy* 

According to this law, the inertial mass of a body is increased 
whenever energy is supplied (e.g. by heating). The energy available 
in the laboratory, however, is far too small to make 'obis increase of 
mass observable in any way, but it is otherwise with the enormous 
energy changes involved in nuclear' processes (see Chop. XLII). The 
loss in mass of the stars, resulting from radiation of energy, is also 
considerable (of. Ex. 67), 

Ex. 67. What is the annual loss in mass of the sun, if approximately 2 calories 
of radiated energy are received by each square centimetre of the earth’s surface 
per minute? (The distance from the earth to the sun is about 16 0 X JO* km.) 

10. Fundamental Concepts of the Generalized Theory of Relativity. 

The Lorentz transformation refers only to frames of reference 
which are in uniform rectilinear motion with respect to each other. 
We have seen that the assumption of an absolute space is superfluous 
and in contradiction with experience for this type of motion. But the 
situation is very much more difficult for frames in accelerated motion, 
e.g. in rotation. The phenomena accompanying rotation do seem to 
point ta the existence of an absolute space, for which alone Newton’s 
Second Law is valid; for the occurrence of inertial forces, such as 
centrifugal or Coriolis forces, indicates that the bodies aro in accelerated 
motion with respect to such a space. But is it not more advisable to 
say that the accelerated motion takes place with respect to the re- 
maining masses of the universe? If these masses, and not a cosmio 
ether, are the cause of centrifugal forces, then such forces would have 
to vanish if the remaining material of the cosmos were removed. But. 
unfortunately, this is an experiment which one cannot perform. Never- 
theless, as early as 1896, B. and J. Friedlander undertook experi- 
ments to detect the existence of feeble centrifugal forces on stationary 
bodies placed near large, rapidly rotating masses, the latter being 
intended to take the place, in a certain measure, of the masses of the 

* The history of this far-reaching law is, briefly, as follows: As early as 1887, Voigt 
took up thd discussion of the time transformation in a little-noticed paper, and in 
1892 Loren tz revived the question in connexion with the optics of moving media. In 
1900 Larmor wrote the Lorentz transformation in the form we know it to-day, and 
four years later Lorentz published a paper in which he treated the optics of moving 
media entirely from the standpoint of the Lorentz transformation. In 1906 Poincare 
and Einstein independently arrived at substantially the conclusions given in § 7. 
Einstein’s paper contained the mass-energy relation. Shortly before this, HasenShrl, 
in Austria, treated the special case of a radiation-filled enclosure, showing that the 
contained radiation would oppose any acceleration of the container as though the 
radiation had a virtual mass given by (83). 
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universe. On account of tiie disparity of the masses in question, the 
experiment would be doomed to failure even now. Even so, we may 
claim that a theory which explains inertial forces through the action 
of matter has the advantage of not introducing a hypothetical space 
(cosmic ether), whose existence has not been verified in the realm of 
uniform motion. But the path which leads to a strict theory based 
upon these ideas is long and arduous. In the first place, we must give 
up something which we are naturally inclined to take for granted: 
the validity of Euclidean geometry in our foux-dimeusional space-time 
continuum. This geometry, which was fundamental in the special 
theory of relativity, appeared in the expression for the square of the 
radius vector £ 2 — sc 3 4- ij 1 -f- (Theorem of Pythagoras). 

The idea that the physical s -pace in which we live might not have 
the properties of Euclidean geometry is not new. Biemann expounded 
the view that the metric of space is a physical property, and not a 
matter of intuition. As long as we measure small triangles on the sur- 
face of the earth with ordinary rough apparatus, we find that all the 
propositions of Euclidean geometry— in particular that concerning 
the sum of the angles of a triangle — are valid. However, if we deal 
with larger areas, the straight lines are replaced by chains stretched 
over the earth’s surface, and more exact measurement reveals that the 
sum of the angles of a triangle is greater than it. In exactly the same 
way, a triangle in space, whose sides are formed, say, by light rays, 
might have an angle sum difierent from rr; in fact, Gauss tried in this 
way, though without success, to find such a departure from the 
Euclidean metric. 

Prom the epistemological standpoint, the idea here advanced 
means nothing less than a reintroduotion of the ether, since we are 
really once more ascribing physical properties to space. But we can 
dispense with this physical space by using the formula: “ It is mere 
assumption to regard space-time as Euclidean. By suitable hypotheses 
with regard to the metric of the space-time frame, a more perspicuous 
representation of the world may be obtained.” 

Einstein’s fundamental idea in the generalized theory of rela- 
tivity was to determine the metric of the space-time continuum by 
means of the distribution and velocity of matter in such a way that 
in a frame of reference relative to which the matter is in accelerated 
motion the inertial forces result automatically. In this connexion, 
use is made of the equivalence of gravitational fields and accelerations 
mentioned on p. 231-. As explained there, this equivalence is based 
on the constant ratio (=1), of inertial mass to gravitational mass for 
all bodies. This equality is by no means a matter of course. Never- 
theless, it forms the basis of the generalized theory of relativity, just 
as the negative result of the experiment to detect an ethqr-wind forms 
the basis of the special theory. In the generalized relativity theory the 
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fundamental law of mechanics is; A particle (or light ray) upon which 
no electromagnetic forces act describes a line of stationary length 
(geodesic, or generalised straight line) in the space-time continuum, 
the metric of ths continuum being determined by the distribution and 
velocities of the material bodies. This connexion is mathematically 
very complicated and cannot be expounded in detail here; the ex- 
pression for a geodesic in the non- Euclidean metric is also compli- 
cated. The new feature is that gravitation, which is not included in 
electromagnetic ©Sects, appears in this manner as a property of space, 
and that the earns centrifugal forces would arise whether the earth is 
considered to turn on its axis or the rest of the universe to be rotating 
about this axis. Hence the conflict between the points of view of 
Ptolemy and Copernicus vanish?®.. 

There are three cases where the generalized theory of relativity 
leads to small deviations from the results of the old physics. The 
calculations yield the result that the perihelia of the planetary orbits 
advance by very small amounts. For the planet Mercury, where the 
effect is greatest, tha amount of this advance is computed to he only 
43" per century. Again, on account of the change in the metric in the 
neighbourhood of heavy bodies, a light ray passing near the sun on 
its way to us from a distant star will suffer & smell deviation, so that 
the position of s star whose line 'of right grazes the sun will be some- 
what displaced. This effect can be looked for at the time of a total 
solar eclipse. Finally, the change m the time metric caused by gravi- 
tation causes spectral lines originating in a strong gravitational field' 
(a.g. that of the sun) to be shifted vary riightly toward the red, when 
compared by an outside observer with lines generated in his own 
system. Ail three effects exist. However, it is a matter of extreme 
difficulty to allow correctly for ah disturbing influences, so that we 
are not yet justified in speaking of a quantitative verification of the 
generalized theory of relativity. 

Since the idea of determining the metric of the space-time con- 
tinuum in such a way that the laws of nature are automatically con- 
tained therein has been successful in the case of gravitation, it was 
natural to attempt to extend the notion to electromagnetic phe- 
nomena also. Einstein, by means of a still more general geometry, 
actually was able to devise & unified field theory which includes elec- 
tromagnetism. Nevertheless, it must be recognized that such a corre- , 
iation embraces only one aspect of the universe; it fails to explain 
atomic structure — the electrons, protons and quanta. 

With the large particle speeds now available in the laboratory, the 
special theory of relativity has become indispensable in atomic and 
nuclear research. However, up to the present time at least, the general 
theory has be^r. applied only to cosmological physics, and more specific 
connexions with experiment are still lacking. 



PAST III 

Field Theory of Electromagnetic and Optical Phenomena 

Two disfexent methods are employed in the theoretical treatment of electrical 
and magnetic phenomena. The first method starts -with maeroscopioally measur- 
able quantities and describes mathematically the relationships between such 
quantities found by experiment. An enormous range of phenomena may be 
brought within the scope of this method in the form of a system of differential 
equations. The integration of these equations for particular oases then permits 
us to answer, by rigorous calculation, a large number of questions, including 
many of technical importance. This method employs only experimental laws 
and their mathematical consequences; it is therefore entirely free of hypothesis, 
and cannot come into conflict with experience. Since it makes no assumptions 
whatsoever concerning the structure of electrical charges or of matter, we call 
this the continuum theory, or the field theory. Nevertheless, it was soon recog- 
nised that a large number of very striking phenomena, e.g. those of electrolysis, 
are entirely left out when this point of view is taken. In order to explain these 
phenomena, we must make assumptions concerning the structure of matter and 
of electrical chargee. We cannot verify these assumptions by direot observation, 
but only by testing their consequences. This second method will be called the 
atomistic method. While the continuum theory will be treated first, we shall 
rje-toxrtbelaas not attempt to exclude all reference to atomistic concepts; on the 
contrary, v?e shall occasionally avail ourselves of a side-glance at the atomistic 
picture of the phenomena for greater vividness. 
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The Electrostatic Field in a Vacuum (or in Air) 

1. Definitions. 

(a) Electrical field strength and charge 

In order to arrive at a definition of electrical field strength we make 
use of an arrangement whose real purpose we shall not learn until 
later (p. 425). Two horizontal metal plates (fig. 1) are placed opposite 
each other at a distance of a few centi- 
metres and are held in this position by 

amber supports. By means of an atomizer, &— 

a cloud of very fine droplets of oil is + 

blown into the space between the plates. 

The drops are illuminated from the side Fig. i 

and may be observed in a telescope. 

We observe that, under the combined effect of gravity and air 
resistance (cf. Stokes’s theory, p. 217), the drops fall with uniform 
velocity. We now touch the upper plate with a glass rod which has 
been rubbed with silk, the lower plate being connected to the earth 
by a wire. It is observed that the motion of the droplets changes: 
some rise, some fall with various speeds,' and others remain suspended 
m space. If we concentrate attention on a single drop, we observe 
that it chruges its velocity now and then, especially if some radium 
b placed near by. The direction of motion, however, is always normal 
to the plates. We conclude from this observation that a new force, 
whose cause is connected with the touching of the upper plate by the 
glass rod, acts on the droplets. The space between the plates has thus 
become the seat of a field of force which is superimposed upon the forces 
already present. We find further that the magnitude of the force and 
its algebraic sign depend on the state of the droplet. Hence we set 
the force vector F equal to the product of a vector E, independent of 
the state of the drop, and a number e which we call the charge * on 
the drop: 

F = eE. (1) 

S is called the electrical field strength. By observing the changes in 

® The dropiec becomes charged when it is formed by atomization of the oil. 
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the operating force, as revealed by changes in the velocity of rise or 
fall of the drop, wo conclude that the droplets alter their charges 
under the influence of the radium radiations. 

The algebraic sign of the charge is, conventionally determined in 
such a way that a particle repelled by the upper plate — the one touched 
by the glass rod — is said to cany a positive charge. In the light, of 
our present, knowledge, this convention is nos a happy one. since the 
most important charged particle, the election, is thus given the nega- 
tive sign. 

(6) Lilies tif force and fiux 

Suppose that we have determined the field strength E at every 
point of space by means of a teat body whose charge we arbitrarily 
take to be unity. We can then specify the field strength at every point 
by a vector. However, such a representation does not afford a very 
good general view of the whole field. It is better to draw curves whose 
tangents give the direction of the field at any point. Then the strength 
of the field must be specified in some other way. We still have a choice 
as to how close together we shall draw these curves, the so-called 
Unes of force. We adopt a method which gives the density of the lines 
of force as a measure of the field strength. The lines of force of the 
electrostatic field form an orthotomic system, i.e. they are the ortho- 
gonal trajectories of a family of surfaces. This is not true of all vector 
fields. At every point of space there is a certain plane, which is tangen- 
tial to the orthogonal surface at that point. We arrange matters so 
that the number of lines of force passing through each unit area of the 
orthogonal surface is proportional to the field strength. The number 
of lines of force passing through an arbitrary area is called the jinx 
or induction through that area. As may be seen from fig. 18 (p. 23), 
the flux through an arbitrarily oriented element of area dS amounts to 


d<X> = EdS. . (2) 

Hence the flux through any finite surface 8 is 

® = j EdS. ....... (3) 


2. Electrical Charge (Quantity of Electricity) as the Source of Flux. 

If we take the integral <£ EdS over a closed surface 3, this gives 
the. excess of the number of lines of force leaving the volume bounded 
by 8 over the number entering. If, in the experimental arrangement 
of | 1, we take the surface 8 so that, it does not intersect the plates, 
we find that the integral vanishes: * that is, no lines of force originate 

* With the proviso, oi course, that there are now no charged particle* between the 
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in the space between the plates. The situation is different if our sur- 
face encloses, say, the upper plate. The surface integral now has a 
definite value, and we can connect the non-vanishing of this integral 
with the fact that the plate was touched by the rubbed glass rod. 
We naturally conjecture that we have thus given the plate an electric 
charge, and can verify this directly by bringing the plate into a similar 
system of larger dimensions; we shall then recognise the presence of 
a charge by the resulting forces, just as in the case of the oil drops. We 
take the value of the integral as a measure of the charge within the 
enclosed region, and set 

eoc^EdS, . (4) 

the units being still undetermined. The electrical charge or quantity, 
of electricity thus plays a double rdle: an active part as the generator 
of a field, and a passive part as the object upon which the field acts. 
The density of charge at a point P of space is the limit of the ratio of 
the charge within a volume element containing P to the volume of 
this element: 

. Ae r RdS 

p — lim r- oc —i — — div E (5) 

r Ar At x ' 

That is, at every point of space the divergence of the electrical fielc^ 
strength is proportional to the density of charge at that point. The 
limit of AA/At is not always finite; in nature we find surface distri- 
butions of charge as well as volume distributions. If we construct a 
cylinder whose base is AS, and whose generators are normal to a charged 
surface., we can make the altitude of the cylinder small of a higher 
order than the base without changing the amount of electricity en- 
closed by it. In this instance we define the surface density of 
charge a to be the limit of AejAS, i.e. the amount of electricity 
on unit area of the surface. Let us take the surface integral 
of the electrical field strength over such a small cylinder (cf. 
p. 43, fig. 27). We distinguish the two sides of the surface by 
the subscripts 1 and 2, and suppose the normal to be directed 
from 1 towards 2. Then, since the lateral surfaces are small of a 
higher order than AS, we have 

AS(«E 2 — «E X ) oc Ae (0) 

If the limit of AejAS is not zero, mE 2 — «E t must be different from 
zero. This means that the normal component of the field strength, 
nE, has different values on the two sides of a charged surface, 
i.e. there is a discontinuity in the field strength. The expression 
m(E 2 ~ Ej) is called the surface divergence of E (written Div E), 
as on p. 44. We may thus say that for a surface distribution of 
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dmge, the surface density is proportional to ike surface dfa&g&Mx of E. 

Div 1 oc c, ....... (7) 

To bring the qoantitie? e, E into connexion with the absolute 
system of unite, we start with the field of a single point charge e. From 
symmetry we see that the lines of force are radial and of equal density 
in all directions. The orthogonal surfaces in this case are spheres with 
centra at s. The induction though a sphere of radius r amounts to 

<£ = <fi Ed& 

^Sph cn> 

Since the density of the lines of force is the same everywhere, I E | is 
constant over the surface of the sphere and 

fy — 'j) Ids 4 wj* j E | «* ye, i.e. } B \ = 

-where y is a factor of proportionality, The force on a second charge 
s' becomes 

»l-& <»> 

This is Cm&vmb's Law* 

The unite are determined by specifying the factor of proportionality 
y. If we place y -- 4*r, we obtain e in the absolute electrostatic system. 
In addition, however, there are three other systems of unit* in use. 
To avoid confusion it is desirable to use only two systems. One ia the 
0,G.S„, introduced above, in winch, all electrical quantities are measured 
in. slecsrossstie unite (e.s.u.) and all magnetic quantities in electro- 
magnetic unite (ejji.ii.), to be introduced later. The other is the 
cations-ILed imftfrMlogran^setmd (M.S.S.) system, sometimes called 
the Oicrgi system after one of its* fi-st proponents. The fundamental 
mechanical and electrical units in this system are the metre, kilogram, 
second, ampere and volt, the size of the electrical units being given 
by their legal, definitions in terms of the amount of silver deposited 
electro! yiicftily and the electromotive force of a standard cell, respec- 
tively. 'The M.K..S. unit of force is called, one newton, and jh equal to 
10 6 dynes. The C.G.S. system is still indispensable in atomic physios, 
for all numerical constants are specified ia terms of these units; but 

* If we examine the familiar diagram giving tf»e Ham of foroa in the neighbourhood 
of We- equal sad opposite point cnim, we see that ail too line* of fores originating 
at the poritiv© charge converge into the negative, Hone* the field is cu longer sphed- 
offiEy sysxuoetrjosl, ee. it wsa before we introduced the second charge, yei we used this 
properly of the field In o&Sod&ting the fore® between two charges. The oor.tradiotion 
disappears if, in calculating the force ca a gives charge, w® draw only the line® of 
force ©manatfag frovi (or converging on) the other charge. The field pattern mentioned 
above gives the forces acting open a third charge introduced into the field. 
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the M.K.S. system has such great advantages in macroscopic problems 
that its use is becoming more and more widespread. In using M.K.S. 
units, however, it must be remembered that even quantities such as the 
number of molecules per mol differ from the familiar C.G.S. values, for 
in the M.K.S. system the kilogram mol is used in place of the gram mol. 

For comparison, the more important electrical formulas will be 
given here in both systems. The M.K.S. unit of quantity of electricity 
is the ampere-second (coulomb), and that of field strength is the volt/ 
metre. Thus the relation between charge and field strength as given 
by (4) contains a dimensional factor of proportionality, called the 
permittivity (or capacitivity) * of free space: 



Zo 

= 8-859 

X 10- 12 ; 

amp. sec./metre volt. 


According 

to (8), 

we then 

have, in 




C.G.S. 

units 


M.K.S. unite 



<y- 

4ir) 


(y - I/E») 


II 

* 

H 

~e-> 

47T«. 


M 

s — K 0 E dH. . , 

. (46) 

div E = 

4irp. 


(oa) j 

- p = K 0 div E. . . 

. (56) 

DivE = 

4-71-0. 


(la) 

a — K 0 Div E, , , 

. (76) 

F = 

"f * ' 


(8a) 

p _ 

4eirK 0 r r * • 

• m 


3. The Electrostatic Potential. 

The work done by the field when the charge e experiences a dis- 
placement ds is 

dW — Fds == eE ds (9) 

If e is moved a finite distance along a curve from P 0 to P lt the total 
work done 13 

W = P'd&is = « f P 'Eds. .... (10) 

p. j p , 

This work is found to he independent of the path C over which the charge 
moves from P 0 to P t . On account of this fact, the integral (h Eds 
taken ever % closed path is zero, i.e. 

fEds = 0. (10') . 

This means that the field strength E is irrotanonal. Now we know 
from vector analysis that every irrotational vector may be repre- 
* The terms dielectric constant and specific inductive capacity refer to material 
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rented as the gradient of a scalar. This means a great simplification: 
instead of giving the numerical values of three components at each 
point, it is sufficient to give the numerical value of one scalar at every 
point. Then by taking the gradient, i.e. by pure differential operations, 
the vector components may be found. The field considered above is 
of this kind, for in this field 

dW — grad W ds, 
and by equation (9), for e ~ 1, 

dW — E ds, i.e. E == grad W. 

However, it is more convenient to work with the quantity (— TF), in 
analogy with potential energy in mechanics. This would be the work 
done against the field when unit charge moves. This quantity is called 
the electrostatic potential V.* It is to be noted that the product eV, 
not F itself, has the dimensions of work. In terms of this concept we 

E = -grad F; f'tids = F 0 — V v . . . (11) 

Taking the divergence of the field strength, we have by (5a) 

div grad F — AF — — irrp, or 6T„AF — p. . (12) 


This differential equation for the electrostatic potential is known as 
Poisson’s Equation. In particular, at all points of a region devoid of 

C ’” 8e ’ &F-0, (13) 

which is Laplace’s Equation. 

On account of the importance of the equations of Poisson and La- 
place, we give the expressions for AF in the commonest co-ordinate 
systems (of. Ex. 22, p. 43): 

In rectangular co-ordinates: 


AF = 


8*f , 


8a* 

In cylindrical co-ordinates: 

AF 


dy* n 3s»* 


In spherical co-ordinates: 
AF 


+ pdp 1 


i!!T , !L F 

p 2 8^ + 8#* 


r 2 sin 2 # d4> 3 ' r* 8d 2_r r 2 


(K) 


. (15) 


(16) 


* More accurately, it is the difference in potential between the end points P e and 
Pj of the path, Of. aquation (11). 
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Ex. 68. Is it possible to produce a uni-directional electrostatic field in 
which the magnitude of the field strength increases in a direction normal to E ? 

Ex. 69. At the surface of the earth the electrostatic field in the atmosphere 
is 100 volts/m., and at a height of 1500 m. the field strength is 26 vclts/m., the 
field being directed toward the centre of the earth. What is the surface charge 
of the earth and what is the mean space charge in the atmosphere between 0 
and 1-500 m. altitude T 

Ex. 70. In a gas discharge between parallel plates, the potential is measured 
as a function of the distance from the cathode, using probes. Explain how the 
field strength and the distribution of space charge oan be found from these results. 


4. Simple Examples of the Electrostatic Field in a Vacuum (or in Air). 

The problem of calculating an electrostatic field may be considered solved 
when the distribution of the potential has been found, for the field is determined 
from the potential by taking the gradient, i.e. by simple differentiation. The 
problem is to find solutions of the differential equation for the potential which 
are in agreement with the particular conditions of the problem in hand. This 
problem may be solved at once if tbfe distribution of charges in the entire space 
is given. But this is generally not the case. We must take into consideration 
the fact that there are substances — so-called conductors of electricity — within 
which no electrostatic field can be maintained, but upon which the charges may 
move freely. Every surface of a conductor must therefore be a level surface of 
the field, and we must find a solution of AF --<=> 0 which satisfies', the boundary 
conditions imposed by the positions of the conductors. It is only after this has 
been done that we oan determine the distribution of charge by evaluating 
Divgrad V, Under these conditions the problem is considerably more difficult 


A. No Conductors in the Field. 


(a) Potential due to ■ point charges 

We begin with the simplest instance of a given distribution of 
charge, viz. the simple point charge. Its field is spherically symmetrical, 
and has already been calculated in § 2 (p. 266). The corresponding 
potential might be set down at once, but we wish to proceed in a 
somewhat more general way. By equation (16) (p. 268), the equation 
for the potential in a space devoid of charge (Laplace’s Equation) is, 
for spherical symmetry. 


The integral is 


<PV 2 dV 
~ dr i + r dr 

. . . (17) 

V=S + B. . . . 

. . . (18) 


B is the value of the potential at infinity. We normalize the potential 
in such a way that it vanishes at infinity. This can always be done, 
since any system of charges in the finite region appears to be con- 
centrated at a single point when viewed from infinity. Hence we may 
specify, as a boundary condition of the differential equation, that V 
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vanishes at least to the order 1 jr at infinity. The constant A is de- 
termined by taking the surface integral over a sphere with centre at 
the point charge. The value thus found by equation (4a) (p. 267) is s, 
while (4 b) yields e/4ffi£ 0 . 

If several point charges are given, their fields are superimposed 
without interference, and so the combined potential is the sum of the 
potentials due to the separate charges: 

F = E or (in M.K.S. units) S . . (18a) 

r t AnK 0 r t 

For a continuous distribution of charge in space or upon surfaces the 
sum becomes 

This equation expressing the independent superposition of the fields 
may be derived directly from the differential equation with the help 
of Green’s theorem. We give the procedure for the part due to a space 
distribution of charge. Let ¥ and ijj be two scalar point functions 
which are continuous, together with their first and second spatial 
derivatives, in a given region. By applying Gauss’s theorem to the 
vectors F grad ift and ijs grad F in turn, and subtracting, we have * 

ij) (F grad (tfi grad Y)dS = J VAifsdr — J ifiAV dr, (20) 

This is Green’s theorem. 

Let F satisfy the differential equation AF = — 4?rp. Let us try 
the effect of taking the function 1/r as the 
value of </>, where r is the distance of -the 
element of charge pdr from the point P 0 at 
which we wish to evaluate V (cf. fig. 2). Then 
thiB function will be discontinuous at P 0 , and 
we must remoye P 0 from the region of in- 
tegration. This is accomplished by drawing a 
small sphere of radius a with P 0 as centre. 
The surface of this sphere is a part of the 
bounding surface of the region of integration. 
According to the convention adopted in the 

* Gauss's theorem (p, 24} is (j) v dS — J div v dr. 

Here v — V grad &. By equation (88) (p. 38), 

div ( V grad i/r) - (grad <j>) (grad V) + Y div grad ifi 
- (grad iji) ( grad F) + FAi/r. 

Similarly, for the function $ grad V, 

div (i/r grad F) «* (grad tfi) ( grad F) + i/AF. 

Substituting in Gauss’s theorem and subtracting, we obtain equation (20). 
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proof given of Gauss’s theorem, the normals to this surface point out- 
ward from the region of integration, i.e. toward the centre of the sphere. 
The outer boundary is taken to be a sphere of infinite radius. The in- 
tegral over this infinite sphere vanishes : the surface element r 3 sin dd6d<j> 
increases as r 2 : but V varies as 1/r, and so grad F varies as 1 jr 2 , hence 
the products F grad (A and 0 grad F vary as 1/r 3 , and so the surface 
integrals vanish for r = oo. Thus there remains the integral over the 
small sphere of radius a , upon which F has the value F 0 , the value at 
P 0 , in the limit. Denoting by n the unit vector directed toward the 
centre of the sphere, we have 

= jT £ (V grad l - ± grad F j no 2 sin 6ddd<j> 

= 4 5rc a ~°, (21) 

since both F and grad V are finite at P 0 , and hence the contribution 
from the second term vanishes for a ~ 0. Then by equation (20) we 

have, since AF = —4 np and A - = 0, " 

< 22 > 

In a similar manner we obtain the contribution of the surface charges 
by enclosing the surfaces of discontinuity in closely fitting surfaces 
which separate them from the remainder of the region. 


(h) Potential due to a ephere of radius a uniformly filled with 
space charge of density p 

We must distinguish here between the external space where the 
equation AF — 0 holds, and the interior, where AF = — 4 rtp. For 
the exterior, the spherically symmetrical solution of AF — 0, viz. 

w A 


is valid. Using <j> 1 dS — Ane, A is found to be \na z p, so that 

m 


For an external point , the effect of the sphere is the same as though the 
entire charge were concentrated at the centre. This solution must still 
hold for r = a: 
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In. seeking a solution for the interior, which again can depend only 
upon r, it must be remembered that the additive constant in the 
potential, has already been determined by the solution for the exterior; 
the two solutions must fit at r — a, since a discontinuity in V would 
mean infinitely great field strength. The field strength in the interior 
is again determined by the flux of force, by equation (4o), p. 267, to be 


E ; 


****P r 
3r 2 


(24) 


The potential is obtained with the proper constant if we start with 
Aee^tion _ F> _ ^ _ r-^, 

J r<a J '*» 8 

By (23'), this gives 

y 4 ira?p , 2*ra 4 p _ 

’ 3' + ~ 3 3 

It may be noticed that the part of V, wliich depends on t has the 
same form, except for sign, as the potential of the force occurring in 
Bimple harmonic motion (Ex. 35, p. 88). If we assume a negative 
charge at P, then the algebraic sign is also the same, and this charge 
will execute harmonic vibrations in the interior of the sphere. 


2wp («*-£). . (25) ‘ 


(c) Potential due to a dipole 

A dipole or doublet is a system consisting of a negative charge 
— e and an equal positive charge +« separated by the elementary 
segment ds. The product eds is assumed to have a finite value wt 
called the moment or strength of the dipole, i.o. if the distance apart 
ds is made smaller, the charge e is to increase in such way that the 
product eds remains finite. As follows from its definition, the dipole 
moment is a vector. If we denote the distance of any point P from 
the negative charge by r_ and that from the positive charge by r + 
we have 

y — (26) 


The change in the point function 1/V is obtained by taking the gradient. 

It must be noted, however, that in 
0 p taking the gradient the “ field point ” 
;_ ds P is moved, while the equipotential 
t surfaces remain fixed. Thus we are not 
permitted to move the charge. Writing 
simply r for r_ in the difference, we 
obtain the following expression by moving the field point (cf. fig. 3): 
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_ edsr __ mr 
- 73 jr 


<s ( m r), or V ■- 


m 


Thus the potential of a dipole decreases as 1 jr z and depends on the 
angle •which the radius vector makes with the axis of the doublet. 
The field is therefore no longer spherically symmetrical, hut is rota- 
tionally symmetrical about the dipole axis. . As in all the formulas 
occurring here, r is directed toward the field point. 


(d) Potential of a double, layer 

A double layer is a surface covered with dipoles, each having its 
axis in the direction of the normal to the surface. All dipoles are 
assumed to be oriented so that one kind of pole is m one aide of the 
surface. Let the dipole moment per square centimetre be wi. The 

potential at a field point P is, by (28), 

(39) 

If the layer is of uniform density (j -m | constant), 

v f , , roos(*gr)dS 

V S m\J — L r ~-. . . . . (2« ) 


Tits solid angle Ml subtended by a surface- element did as sees from P 
ig defined as the area cut out on a unit sphere drawn about P fey the 
cone determined by d8, and this has exactly the same magnitude as 

the integrand of (29'). Hence 

7 — \fn\Q.. « (29") 


According to the convention regarding the direction of r, V is positive 
if P faces the positive side of the surface. Up to now we have con- 
sidered only discontinuities in. field strength, but for a double layer the 
potential iteelf suffers a discontinuity and so the field would become 
infinite in the layer if the latter really were infinitely thin. This, of 
course, never happens in practice. 

Consider next a closed double layer, say in the form of a sphere. 
If elementary cones are drawn from an external point, each one must 
cut the layer twice, and the cosine of the angle between n and r will 
have opposite Bign at the two places (n, as usual, is drawn outward 
at all points). Thus the contributions nullify each other in pairs, so 
that outside the surface the potential is zero. For an internal point. 
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each nappe of an elementary cone cuts the surface only once: on i 
tegr&ting, the whole surface of the unit sphere is covered. The sign 
the contribution is negative, however, since n and r are opposite 
direction. Then 


V a — V, = 4tt i m 1, 


(30) 



or using the surface gradient, this may be written 

Grad V — inm, or K 0 Grad V -vn. . . (30') 

This equation holds also for unclosed surfaces. Any unclosed surface 
may be considered changed to a closed one by supplying an additional 
portion?. The potential of the added part suffers no discontinuity in 
passing through the original surface, and so contributes nothing to 
(30'). Double layers arise, for example, when a metallic plate is dipped 
into a salt solution. The term “ electromotive force ” as used to denote 
the corresponding discontinuity of potential is obsolescent. 


B. Conductors present in the Field. 

(a) Spherical cordenser 

Let there be given two concentric conducting spheres of radii 
a and b (b > a). Assume a charge e to be placed upon the inner sphere. 
The outer sphere is to be held, at a potential V b , say by means of a 
conductor connecting it to earth. We may call tins potential zero. 
Naturally, the spherically symmetrical solution of the potential equa- 
tion is again valid here, but the constants have values different from 
the previous ones. We have 

F = - 4- B. 
r 


By means of the total flux, the value of A is again found to be e\ the 
value of B is found by imposing the condition that V •- V b for r ~ b, 
whence 


Hence for the inner sphere, 



Now the electrical capacity of a system is defined as the ratio of the 
charge to the potential difference of the bounding conductors. Hence, 
for the spherical condenser, 



(33) 
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If the radius of the outer sphere is allowed to become infinite we obtain 
0 — a, or 0 ~ 4r,-A 0 a (34) 

That is, the capacity of a single isolated, sphere, expressed, in electro- 
static units, is equal to its radius in centimetres. In the M.K.S. system, 
i he capacity of a sphere, of 1 cm. radius turns out to be Til X 10 _1 * 
farad, the farad being the practical unit of capacity. 

(b) Point charge near an infinite plane conductor at zero 
■potential. Electrical image ts 

Let the conductor be the ay-plane (fig. 4), and let. the point 
charge be at » — a. In the space above the plate, the potential satis- 
fies the condition that V vanishes on the ay-plane, and that for every 
surface enclosing the point charge, 

(grad F'jdS — —dare. 

In the space below the plate, it is required that V vanish at the ay- 
plane, and also that the integral j) (grad V) dS vanish for every closed 
surface, since there is no charge in this region. These requirements 



are fulfilled if we take V ~ 0 at all points below the plans. In the 
upper space V must also vanish at the ay-plane, but EdS must be 
equal to dire for eveiy closed surface roimd the charge. In this case 
the solution may be given readily, vis. if we place a charge — e at 
the “ image-point ” of the original change, the field due to these two 
charges satisfies all the required conditions in the upper space. Hence 
in the upper region 

r=e (H)- 


(35) 
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This expression, which, is a sum of potentials due to point charges, is 
certainly a solution of the diSerential equation for the potential. 
Moreover, for the ®/-piane, r =i r', and so V vanishes there. Finally, 
closed surfaces drawn in the upper 3pace must either contain e or else 
contain no charge whatsoever, since the charge —e lies below the con- 
ducting plane. AH lines of force originating on e terminate on the 
sy-piane. Since there is no field below this plane, the field strength 
is discontinuous at cho tty -plane, and so a charge must be present on 
this plane. In order to determine this charge we first calculate the 
field strength at the surface of the conducting plane: 

F. = —grad V — ~ r' 0 . , . . (36) 

Since r = r’ at the surface of separation, the resultant is a vector Eg, 
normal to the ay-plane, having the negative ^-direction and of mag- 
nitude (see fig. 4) 

1 % | == ?| sin (9 — ~ o. . . . . . (36') 
Hence by equation (la) (p. 287), the surface density of charge is 



By computing the integral £ adS we may readily verify that the total 
charge induced, on the conducting plane is equal to — e, corresponding 
to the fact that the entire flux emanating from -fe terminates on the 
conductor. 

Ex. 71. Starting from first principles, compute the force of attraction between 
the two plates of a parallel-plate condenser, each carrying a charge of magnitude 
e. (Maha use of the linestof-force picture and neglect disturbances due to the 
edges of the conductors.) 

Ex. 72. Following the method used for a spherical condenser, calculate the 
capacity of a cylindrical condenser consisting of two coaxial cylinders of length 
t and radii a and 6 { b > a). Find also the capacity of a parallel-plate condenser 
consisting of two equal plates of area 8 placed at a distance d apart. Neglect 
edge effects. 

Ex. 73. A point charge e extern al to a conducting sphere is at a distance 
JR from the. centre. The sphere is held at zero potential. Find the potential at 
any point in the resulting field. (As in the case of the plane, the effect of the 
sphere may bo replaced by that of a virtual charge within it.) 

Ex. 74. What is the nature of the equipotential surfaces at a singular point 
in the field where E = 0 ? Take the origin of a co-ordinate system at this point 
and develop the potential in a power series in the co-ordinates. Consider especially 
the case of rotational symmetry. 
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The Electrostatic Field in Dielectric Media 

1. Formal Introduction of the Concepts “ Dielectric Displacement ” and 
“ Free Charge Boundary Conditions at the Surface of Separa- 
tion of Two Dielectrics. 

We start with the following experimental arrangement. A parallel 
plate condenser is connected to a quadrant electrometer which indi- 
cates the potential difference between the plates. First, let the sub- 
stance between the plates be air. If we now introduce into the space 
between the conductors a slab of insulating material, such as a flat 
piece of hard rubber or a glass plate, the potential difference will be 
observed to decrease: upon removing the slab, the potential will 
return to its former value. According to our definition of capacity, 
this quantify has been increased by introducing the insulator. Since 
the introduction of the insulating material could not have altered. the 
charge on the conductors, we are led to conclude that the connexion 
between field strength and charge, found in Chap. XI, § 2 (p. 265), must 
be modified if the space in which the field operates is filled with a 
medium other than air. Indeed, we find even a small effect due to 
the air if we evacuate the space between the plates, but this effect is 
relatively so small that it may generally be neglected. However, we 
shall take the results of Chap. XI to be strictly valid only for a vacuum.* 
The ratio of the field strength in vacuum to that in an insulator 
(or dielectric), for the same distribution of charge, is called the specific 
inductive capacity or dielectric constant of the medium, i.e. 

E„=-XE (1) 

We now introduce a new vector D, the dielectric displacement vector. 
The same relationships are to hold for this vector as for the vector E„ 
in a vacuum, viz. 

while in a vacuum, 

In the M.K.8. system, the factor 1/X 0 appears in place of 4w on the 
right side of the equations. 

* The effects for a vacuum and for air differ by less than 0-06 of 1 per cent at 
standard conditions. 


(p DdS —ine, divD — inp, DivD —4 rrer, 
E v dS = f.Tre, div E„ = iirp. Div E e = 47tct. 
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On account of (1), the connexion between D and 1 is given by 

D =s» KE (3) 

This simple relation between D and E is valid only for isotropic substances, 
i.e. amorphous materials and regular crystals. For certain anisotropic dielectrics, 
generally of crystalline structure, the relation (!) must be modified. In such 
substances the field strength E is a linear function of the vacuum field strength, 
i.e. the dieleotric constant is no longer a scalar, but a tensor. We shall, however, 
restrict our considerations at present to isotropic substances. 

On the other hand, the field within a dielectric (which actually 
may be measured, say by the force on a small test body) may be con- 
sidered to be the field of certain fictitious charges which would produce 
the same field in a vacuum. Such charges are called “ free ” or 
“ apparent ” charges. These charges, whiteh we shall denote by accents, 
are given by the equations 

j) E dS = 4tjre, div S = 4 770 ', Div E — 4W. . (4) 


True charges occur either as space charges or as charges on the surface 
of conductors. On the other hand, free charges are found also on the 
surface of separation of two dielectrics, as we shall see at onco. Since 
no true charges exist at the interface of two dielectrics, we have 
Div D—E x E n m _ = 0 (5) 


But this means that DivE is different from zero, since it is only the 
products of the normal components of E by the respective K’e which 
are equal. Hence the surface of separation of two dielectrics is the seat 
of apparent field-producing charges. To secure information regarding 

the tangential components of E we 

make use of the empirical fact that f-— -f — | — - 

the electrostatic field is everywhere Fjg , 

irrotational, even if there are several 

contiguous dielectrics in the region. If, then, we cam? a unit 
charge over a closed path drawn near the surface of separation of two 
dielectrics, and with one branch on each side of this surface (fig. 1), 
the line integral must vanish. This means, however, that the tan- 
gential components of E, which alone contribute to the integral, must 
be equal on the two sides: 

E (1) = E (2) . (6) 

tan tan* * * * * * ' / 

This equation may be written also in the form 

[«(E,-Sy]«OurlB-0. . 


(60 


The continuity of the tangential component of the electric field strength 
at the surface of separation of two dielectric media is not limited to 
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electrostatic fields, but is a g< 
will be justified later (p. 344) 


V 


fN. |e“ 

X 


jneral property of the electric field, and 
in a much more general way. 

The two boundary conditions (5) 
and (6) famish us with the law of 
refraction or lines of force. If we 
consider a plane (fig;. 2) determined 
by the tangent to a line of force 
in the first medium and the normal 
to the interface, it follows from 
(6) that E 2 must also lie in this 
plane. Further, if <[ it: the angle 
between the tangent to the line 
of force and the normal to the inter • 
face, it follows from Div(ifE) = 0 
that 


If, Ej cos <i, ---■ K%E S cos 
and from Curl E — 0 that 


Dividing one equation by the other gives the law of refraction for 
lines of electric force: . , „ 

t an ft, __ a. 


tan K % ' 


( 7 ) 


l±iJ: 


'Hi 


2. Polarization of Dielectrics. 

We begin with a homogeneous dielectric medium in which a 
uniform field E exists. The density 
of the field lines of the vector D 
is K times that of the field lines 
of E. We now take a small cylin- 
der whose generators are parallel to 
E and remove the contained material 
of dielectric constant K, at the same 
time placing charges on the bases of 
the cylinder in such a way that the 
field is unaltered. Inside the cylin- 
der we now have D ~ E, while out- 
Fijf. 3 side this region we still have D = ffE. 

Since E is to have the same value 
at all points, lines of displacement originate or terminate on the ends of 
the cylinder. Thus the ends carry true charges whose surface density may 
be calculated by equation (2) (p. 277). On the upper surface iu fig. 3, 
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while on the lower surface, 

Hence the volume element dr — S dl which was removed has a dipole 
moment amounting to 


dF =— 1 — -E Sdl — 3 -E dr. . . (8) 

4rr 4:77 

The dipole moment per unit volume is called the polarization P: 

p = — E = «-E, ...... (9) 

and we thus have * 

D — • EE — (1 + 4ttk)E = E + 4ttP. . . . (10) 

The newly-introduced constant of the material, k, which, by (9), 
stands in simple relation to K, is called the electrical susceptibility of 
the medium. In this manner we can replace all elements of volume of 
the dielectric by dipoles of moment /cE dr. 

The atomistic interpretation of the dielectric constant indicates that the 
vector P is the truly physical quantity, while D appears merely as an auxiliary 
mathematical quantity. As will be explained in detail later (p. 450), a resultant 
dipole moment arises in two ways in a volume element containing a large number 
of atoms or molecules. If the elementary particles are themselves dipoles (mole- 
cules with resultant positive and negative constituents), the random nature of 
their orientations makes’ the resultant moment zero, provided no external field 
acts. An applied field tends to bring the dipoles into its own direction, while 
thermal agitation tends to destroy this orderly arrangement. The greater the 
field, at a given temperature, the greater the resultant moment in the direction 
of the field. On the other hand, it is clear that for a given external field, this 
resultant will become smaller as the temperature is raised. This type of sus- 
ceptibility, in analogy with the more familiar corresponding magnetic quantity, 
is called “ pnra-eleotric ”, If, on the other hand, the elementary particles have 
normally no dipole moment, the field will produce one by distorting the electron 
shells. Of course this effeot, which is independent of the temperature, ocours 
also in the case of para-electric susceptibility, but is masked by it on account of 
its smallness. If the entire dipole moment of the particle is thus produced by the 
applied field, we speak of “ dia-electric ” susceptibility. 

If. in addition to the distribution of the true charges., we know 
the polarization as a function of position, then the electrostatic poten- 
tial may be found by integration over the volume and surface elements. 
Since according to equation (28) (p. 273), the potential of a dipole of 
moment Pdr is given by V — (P rjr z )dr, 

* In the M.K.S. system k - K — 1 and D “ l^E + P, but all tables of k are given 
in terms of the definition (0). 
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the complete potential when dielectrics are present is given by 

y-^-r+r-f +/$■*• • • • eu 

The volume integral contributed by the dipoles of the polarized 
dielectric mav be transformed into a surface integral as long as we 
are dealing with homogeneous bodies of uniform dielectric constant. 
Here divP vanishes when no volume charges are preseht, according 
to equation (8) (p. 278) and equation (10) (p. 280). We therefore 
have 

divPdr + Jv grad, ^ dr =j ~ dr. . (12) 

The field point is fixed during the integration, and the integration 
is extended over the several volume elements oi the region. Conse- 
quently, in taking the gradient, 
, ~ r o § r r c, the field point is to be held fixed, 

anuL s r ' f p-xdj- r while the “ source points ” are 
Fig. 4 moved. This is indicated in (12) 

by the subscript s. According to 
fig. 4, the gradient of r so obtained is equal and opposite to that 
obtained by moving the field point. We denote the former gradient 
by the subscript 8, the latter by the subscript /. As before, the vector 
extending from the charge to the field point is denoted by r* 

The surface integral in equation (12) can als < be given another 
interpretation: If an insulator is adjacent to a vacuum (P == 0), then 

PdS = — DivPdS, 

and by equation (10) (p. 280), since Div D= G, 



Hence there are free charges on the surface of the dielectric, as a result 
of the polarization there, and we may imagine the added field to have 
its origin in these charges, whose magnitude is given by equation 
(13) and equation (12). 

* In co-ordinate form it may be shown at once that each derivative of r with 
respect to the co-ordinates of the source point has a value equal and opposite to that 
of the derivative with respect to the co-ordinates of the field point. Thus, if the source 
point has the co-ordinates ff, ij, {, and the field point has the co-ordinates as, y, z, we 

r - Vtx-'J)* + [f- q) a + (» 
so that dr fix - - dr /3{, <to. 
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8. Simple Examples of the Electrostatic Field in Dielectrics. 

Equation (il) gives the potential, from which the field may be 
derived, if the distributions of the charges and of the polarization afc 
known. But usually the polarization is no more known as a function 
of position than is the distribution of the charges (cf. p. 268). Since 
P =--- xE, we must know the field strength E, which has been altered 
on account of the presence of the dielectric. The problem admits of 
exact solution in only a few cases. In the absence of volume charges, 
we are concerned with finding a solution of AT =.- 0 which satisfies 
the conditions that (1) V has constant values on the surfaces of the 
conductors, and (2) at the surface of separation of two dielectrics, 
Curl E=- 0 (continuity of the tangential components), and Div(A'E) — 0 
(prescribed discontinuity of the normal components of E). As an 
example of the problems permitting of exact solution we consider the 
distortion of an initially uniform field E 0 caused by the introduction 
of a dielectric sphere. We make the obvious assumption that the 
polarizabiap within the sphere is homogeneous and parallel to E„, 
but we do not yet attempt to determine the magnitude of P. We 
can show that, by suitable choice of the value of P, the added field 
arising from the dipoles of the polarized sphere, together with the 
original field, can be made to satisfy the boundary conditions. We 
may imagine the existence of the field of the uniformly polarized 
sphere to be due to a sphere homogeneously filled with charge of 
density — p and another filled with charge -J -p, the centres of the two 
being separated by ds. In order that every element of volume may 
have the dipole moment Pdr, we must have 

Pdr — pdrds, ...... (14) 



Outside the sphere (r > a) the added potential F s is, by p. 261, 
{T (JC 4.7ra 3 4 tt£5 3 /I , ds r\ 

'• “-T' + TfiriT/'T?" (16) 


Since grad (Pr) is equal to P when the vector P is constant,* the total 
field in the external space is 



= E 0 - 


i-rra- 4ira s Pr 


(18) 


* This may be seers intuitively from the meaning of the gradient, or formally 
from P r P x x + i\y + P,z — u by calculating the- components dujdx, oso., or by 
taking the x-axin parallel to P. 
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The field is therefore no longer constant in direction, for there is a 
component directed outward along the radius vector from the centre 

Inside the sphere, the added potential 
is, by equation (25) (p. 272), 

F a <r >= —2 7rp(^-~ T -') 

s + (17) 

whence E (/) = E 0 — ~ P. . . (18) 

Provided that our solution satisfies the 
boundary conditions, the internal field 
remains homogeneous, since P is assumed 
to have the direction of E. 'Thus we 
have in addition to the applied field E c 
another field of strength 



oppositely directed, whose source is the system of free surface charges 
connected with this field by equation (13) (p. 281 ): 

o' = — DivP. ...... (19) 

In the analogous magnetic case, one speaks of the demagnetizing effect 
of the boundary surfaces, and the numerical factor in (18) — 4w/3 for 
the sphere — is called the demagnetization factor ■ for this particular 
case. 

We still have the magnitude of P at our disposed in order to satisfy 
the boundary conditions. Denoting the angle between r and E 0 
(fig. 5) by 0, we have for the tangential components 

E,J g> — E 0 sin 6 — '™- P sin 6 — E tsn <,) = E 0 sin 6 — ~ P sin 9. (20) 

The continuity of the tangential components is thus automatically 
fulfillfed. For the normal components, the equation to be satisfied is 

K s E„ w = K X E 0 cos 6 - K,~ P cos 6 + K x P cos 6 

* 1 1 3a 3 a 3 


of tlje sphere (fig. 5). 

{Iff 



— P 7 E„*" = KjE 0 cos 6 — ^ K S P cos 9, . . (21) 
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Foi tlie present case, where K s — 1, K, — K, this gives the following 
relation, which may be written at once in vector form since E 0 and P 
h"ve the same direction: 


P = 


K- 1 
K + 2 ' 


(22) 


By substituting this value in equations (16) and (18), we obtain the 
field strengths within and exterior to the sphere. The former is 


(23) 


For infinitely large dielectric constant K„ vanishes. In electro- 
statics this is equivalent to the case of a conducting sphere. Hence 
putting K — oo in equation (22) gives the polarization of an insulated 
conducting sphere placed in a uniform field E 0 : 


The total dipole moment of such a sphere is then 


m — 


4wa 3 p __ 
3 


a 3 E 0 . 


m 


The polarizability, i.e. the ratio of dipole moment to field strength, is 
thus equal to the cube of the radius. 

Ex. 76. In a uniformly polarized dielectric let the material within a sphere 
of radius a be removed without disturbing the polarization at any other place in 
the remaining medium. Determine the field produced at the centre of the sphere 
by the surrounding medium. (Observe that the supplementary field may bo 
ascribed to free chargee on the surface of the sphere.) 

Ex. 76. Calculate the capacity of a spherical condenser whose dielectric has 
a specifio inductive capacity K x from r = a to r = b and one of magnitude R\ 
from r =» b to r = c (a < b < e). 

Ex. 77. A point charge is near the infinite plane face of a dielectric medium. 
Calculate the field at points outside the dielectric and within it. (Here, as at p 
275, we oan find a solution satisfying all the boandary conditions by placing a 
charge of suitable magnitude at the “ image point ” of the given charge.) 



CHAPTER XIII 


Energy and Ponderomotive Forcer in the 
Electrostatic Field 

1. Potential Energy of Systems of Charges in a given Field. 

In contrast with the procedure in the preceding section, ve consider 
the electrostatic field E to be given here, and introduce into it charges 
which are so small that they do not influence the distribution of the 
charges which produce the field. 

The potential energy of a simple point charge at a place where 
the potential of the field is F is 

« e « eV (25) 

If we have a dipole, and if the potential at the negative charge is V, 
the potential energy of the doublet is 

u d = —eV + e[V + ds gradF] = —eds E = -mE. (26) 

In a homogeneous field the potential energy can change only by 
changing the angle between m and E, and for pure translation the 
energy remains constant — i.e. a parallel displacement of a dipole in 
a uniform field involves no work. But since work is represented by 
the scalar product of force by displacement, this means that in a 
homogeneous field no force acts upon a dipole, tending to displace it. 
On the other hand, a rotation of amount 89 requires an amount of 
work M 86, where M represents the magnitude of the torque acting 
upon the dipole. Hence 

M 86 — — 8u d — — E j vn j sin#$0, 
and the tinning moment acting on the dipole is therefore 

M = -\tn\Esm6 (27) 

The negative sign means that the direction of the torque is that tend- 
ing to decrease 6. 

So far we have considered the dipole moment to be constant, but 
the case where the dipole moment is proportional to the field, (i.e. 
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— ftl) is equally important. A force eE acts upon the positive 
charge, tending to pull it away from the negative charge, considered 
fixed. At equilibrium, this force is balanced by an equal and opposite 
internal force. If the field increases by dE, the change in energy is 
equal to the change of energy of the dipole [equation (26)], plus the 
work done in displacing the charges by dr against the internal forces. 
That is, 

dtij “ -~md& — E dm -f eE dr — —tndE — aEdE. 

The total energy of the dipole induced by the field is obtained by 
integration from 0 to E: 



Ex. 78. Find the translational force acting upon a dipole of given orientation 
in a non-homogeneons eleotrostatio field. 

2. Total Energy of the Eleotrostatio Field. 

In electrostatics there are two different but essentially equivalent 
ways of considering the total energy of the field. The older method, 
which we shall call the theory of “ action at a distance >; , considers 
only the charges and the forces operating between them, without 
attempting to explain how such action is brought about. On the 
other hand, the theory of direct action locates the energy hi the inter- 
vening spc*ce also, and implies that wherever an electric field is present 
there is also a certain continuous distribution of energy. (The picture 
suggested is that of an elastic body, say the ether, in a state of stress. 
In spite of its great heuristic value, we can no longer ascribe physical 
reality to such a picture.) The superiority of the theory of direct 
action becomes apparent in the caao of rapidly varying fields, for it 
leads simply and naturally to the existence of electromagnetic 
waves. 

We begin, however, with the theory of action at a distance, and 
consider the potential energy of two point charges and e a in a vacuum. 
This is given by the potential of the Coulomb force, and so may be 
written down at once: 

• • • « 

If there are other point charges, we can calculate the total energy 
by starting with the charge 1, bringing in charge 2 from infinity to 
the distance then bringing up charge 8 from infinity until it is at 
. prescribed distances r 13 and from, these charges, and so on. The 
total work done in thus building up the field— i.e. the potential energy 
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U = ^ + . . \ + 4. \ + /?*& _i_ . . \ 

\ r 12 r 13 r U / \ r 28 r 24 / V r S4 / 

= is'r«*, otis-s'j^r- ... (30) 

f A n* » :* 47 r2t 0 r tft 


The factor 1/2 occurs before the double sum on account of the fact 
that every combination of i and k occurs twice in taking the double 
sum. The accents on the summation signs indicate that the com- 
binations * — k are to be omitted. Now 


or, in M.K.S. units, S' — 

. Uh i 

signifies the electrostatic potential, at the position occupied by the 
charge k, caused by all the other charges. Thus we may write 

Z7 = JSF*e s (same in M.K.S. units). . . (31) 


The idea may be extended to include continuously distributed charges. 
For a volume charge, e — pdr: for a surface charge, e = crdS. Hence 
the energy when volume and surface charges are present is 

U — h fVpdr -f VadS (same in M.K.S. units). (31') 


In the theory of direct action we must ascribe a given energy con- 
tent to every element of volume of the region where an electric field 
exists. The most natural assumption is that the energy density u (the 
energy per unit volume of the field) is proportional to the square of 
the electric field strength, for E 2 is the simplest scalar with which the 
scalar u may be connected. It is easy to verify, also, that E 2 has the 
dimension of energy per cubic centimetre in the electrostatic system. 
The factor of proportionality is to be determined in such a way that 
numerical agreement with the theory of action at a distance results. 
The equivalence of the expression so determined to (31') is easily 
shown. Thus, if we apply Gauss’s theorem to the product EF, aud if 
we remember that E vanishes as 1/r 2 and F vanishes as 1/r at infinity, 
the integral over an infinite sphere is zero, so that 

0 = £ (EF)dS = Je grad Vdr -\-f V div Edr. . (32) 

Since E = —grad F and div E = 4np, or = p/K 0 , 

J ¥?dr — j knV pdr, or — — JVpdr. . . (33) 


we have 
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On the other hand, if only volume charges are present, the total energy 
of the field is, by (31'), 

U = l f Vpdr. 

We obtain numerical agreement if we take the energy density in a 
vacuum to be 

« = ~ E 2 (C.G.S.), or M = pr 0 E 2 (M.K.S.). . (34) 


If surface charges — and henoe surfaces of discontinuity of E~ arc 
present, these are to be excluded from the field by closely fitting 
closed surfaces, when applying Gauss’s theorem. This is the origin 
of the second integral in equation (31'). 

In a medium where the dielectric constant is K, the force between 
two point charges is decreased in the ratio 1 : K, and the theory of 
action at a distance gives 


= k-K S' 2' 


k K r ik 


(SB) 


But e* jK is the apparent charge e k , from which .the field in the medium 
of dielectric constant K may be derived, just as the field in a vacuum 
may be derived from the true charge e. Passing to a continuous 
distribution of charge, we write 

. = f J Fp'dr+f f Vo'dS. . . . (35') 


If r»e again take u proportional to E 2 in the direct action method, the 
same transformation as was used in equation (32) gives 

« = }- KW = -- ED. or u E 2 . . . (36) 

tin 8 tt 2 ' 

In the second form, the equation is also valid for anisotropic sub- 
stances, where E and D .have different directions. 

We shall next consider the change in the eflSi'gy of a field which 
results from introducing a dielectric body. The body will be assumed 
to be so small that its introduction does not alter the distribution of 
the charges producing the field. First let there be given a vacuum 
field Eq = D 0 . By bringing in tbe dielectric body, we produce a new 
field E and a new displacement D at every point. Hence tbe energy 
change is given by 

V- U 0 = 1 J(ED- E 0 D 9 )dT. . . . (37) 
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This expression may be Prewritten as follows: 

ED — E 0 B 0 = E (D — D 0 ) + (E — E 0 ) D 0 . 

It io a fundamental property of the electrostatic field that E is irro- 
tational throughout the entire region (p. 267). Therefore we may 
set E equal to — grad F. Again, div (D — D 0 ) is everywhere zero, and 
hence D — D 0 is free of sources (“ solenoidal ”), since the true charges 
— the sources of D^-were not changed by the introduction of the 
dielectric substance. Moreover, E and D vanish at infinity as l/r a , 
aDd V vanishes as I/r. Hence Gauss’s theorem gives for integration 
over an infinite sphere: 

0~ (p F(D— D 0 )dS==— Je(D— D 0 )dr-h J F div (D — D 0 ) dr. 
Since div (D — D 0 ) vanishes everywhere, it follows that 
jE (D — J> G )dr — 0 

and TJ —n, t = ~ [CE — E 0 ) D 0 dr. . . . (38) 

077 „ 

But for the same reason, the integral J E C (D — ~D 0 )dr vanishes also. 
If we multiply this integral b.y — 1/8 ir and add it to (38), we 
obtain 

U-D.-±f (E» 0 - E 0 D )dr = ^ f (EE 0 - E u D)dv 

= — $ j /v EE 0 dr, (39) 

since D„ ~ by hypothesis. From the definitions of electrical sus- 
ceptibility and polarization (p. 280), 

27 — (Jq ~ — i J *:EE 0 dr — if'S'E^dr. . . (40) 

If the susceptibility is small, EE 0 may be replaced by E* without intro- 
ducing appreciable error; i.e. the alteration of the field may be 
neglected. If we make this substitution in the first integral of (40), 
we recognize at once the meaning of this expression. In a field E 0 . 
every cubic centimetre of the dielectric has a dipole moment kE 6 . 
and fay § 1 (p. 288) the energy of a dipole induced by the field m given 
by 



The change in the field within the dielectric, as calculated for the 
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sphere on p. 282, causes the scalar product EE 0 to appear in place 
of V- 

This decrease of the held energy for constant field-producing charges, 
easily verified in the case of a parallel-plate condenser with and with- 
out dielectric, must not be confused with the increase in the energy 
content of a dielectric when the field is applied. In the latter situation 
the field-producing charges increase. According to (36) the change in 
energy density is 

= ^ EdE = iii^EdE = i-EdE-fPdE. (36') 

4 m vn ' vtt 

The last form shows that the second term represents the change in 
energy of a material body, in which P =4= 0, while the first term repre- 
sents the change in energy of a vacuum. 

Ex. 79. Show that the energy of a charged condenser is £CF a and compute 
the change in energy when a dielectric is introduced — in one case with the charge 
on the plates held fixed, in the other case with the potential difference held con- 
stant. 

Ex. 80. Compute the work required to put together infinitely dispersed 
oharges to form a sphere having a uniform charge density p and a radius a. 


3. Forces and Equilibrium in the Electrostatic Field; Theory of the 
Manometer Method for the Electrical Susceptibility of a Liquid. 

In mechanics, the position of equilibrium of a system may be 
determined . by the condition that the potential energy . must be a 
minimum. Now, according to § 1 (p. 285), the energy of the electro- 
static field is equivalent to the potential energy of the charges, and 
so equilibrium is determined by the con- 
dition that the sum of the dectrical field 
energy and the mechanical potential energy 
(e.g. that of gravity) is to he a minimum. 
We illustrate this principle with the theory 
— ho of the manometer method for determining 
the electrical susceptibility of a liquid, which 
is especially important in the analogous 
magnetic case. A U-tube of cross-section 
S is filled with a dielectric liquid. Let 
Fi« i. an electric field, normal to the axis of 

the tube, be applied in the region of the 
meniscus in one arm of the tube (fig. 1). The lines of force are 
normal to the plane of the diagram, their trace in that plane being 
indicated by dots. Application of the field causes the meniscus 
in this side of the tube to rise. If the density of the liquid is p, 
and if the equilibrium height of the meniscus in the absence of the 
field is A 0 , the work done against gravity amounts to Sp(h — 
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for the mass Sp {h — h 0 ) has been moved from the right side to the left, 
whereby the centre of gravity has been raised a distance h — h 0 . If 
the susceptibility is small, so that E may be set equal to E 0 , the total 
energy becomes [of. equation (40) (p. 289)] : 

U « r/ 0 - i fkBfSdh 4- 8p{h - h 0 fa. . . (41) 

J o 

hence the 


. (42) 


This means that the total energy depends only upon A; 
■minimum eoncution gives 

0 - § - H*M,S -f 2 S P (h - h D )g, . 


and the difference in level of the menisci is 




2 (h-K) = 


2pg‘ 


(43) 


As is evident from the analysis, E 0 signifies the strength of the field 
applied to the meniscus on the left; the field need not be measured 
at other places. 

In many cases it is not so important to know the position of equili- 
brium as it is to find the force actually operating on the system. For 
example, we may wish to find the motion of an uncharged oil drop in 
a non-homogeneous field, and must therefore know the forces acting. 
The droplet is assumed to be so small that the' field may be con- 
sidered homogeneous within the spherical volume (radius a) which 
it occupies. Then, by equation (22) (p. 284), the polarization within 
the sphere is 


3 K -1 
iir ~K + 2 




and by the exact formula (40) (p. 289), the electrostatic energy is 
U=U, ~\j FE 0 dr « Z7 0 _ * a*PE rt - U 0 ~ ? E 0 h (44) 


Since we assume the droplet to be so small that we may reckon with 
a constant value of E 0 within a volume of this magnitude, the energy 
of the system will depend on the position of the centre of the sphere. 
If the field E u is given, there is a definite value of the energy corre- 
sponding to each position of the centre of the droplet. Points at which 
this value is the same may be connected by level surfaces. If the 
sphere undergoes a displacement ds, the change in the potential energy 
of the field is 

dU = (grad V)ds = — a ~ (grad E *)ds. . (45) 
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Ob the other hand, the force acting on the sphere does work 

dW = Fds. 

Since this work must be equal to the decrease in potential energy, 
we have • 

g»av. («k 

Since K > 1, the direction of the force is that in which E 0 2 increases; 
hence the droplet is urged toward regions of greater field strength. 
In magnetism, where we shall find far-reaching analogies with electro- 
static phenomena (Chap. XY), the magnetic permeability /a is the 
counterpart of S. We encounter values of p less than 1 ae well as 
greater than 1. depending upon the nature of the substance. Spheres 
of those substances for which p < 1 will he urged out of the 
magnetic field, and this difference in behaviour has led to the 
classification of magnetic materials into paramagnetic and dia- 
magnetic substances. 

Ex. 81. The Absolute Electr.ymeler . — A flat metal pis te of area & is suspended 
from one of the two equal arms of a balance sc that it is at a distance a from 
another horizontal plate of the same size. What weight must be added to the 
other side of the balance to maintain equilibrium if the two plates are charged 
to a difference of potential V when the medium between she plates ha (a) air, 
and (6) a material of dielectric constant K ? 



CHAPTEB XIV 

Stationary Electric Fields and Steady Currents 


1. Ohm’s Lav?. 

We know from what has gone before that it is impossible to main- 
tain an electrostatic field in a conductor. However, by continuously 
supplying energy, a field which is constant in time can be main- 
tained. This field differs from the electrostatic field in that, in the 
first place, it is accompanied by a continuous evolution of heat. 
But there is .another difference : an actual transport of electric charge 
takes place in a field of this kind, as may be verified directly, for 
example, in the case of electrolytes. This continued motion of electric 
charges, which we shall refer to as an electric current, is also made 
evident by the magnetostatic field which accompanies it; this will be 
discussed in the following chapter. 

The current strength, or simply the current I, is defined as the amount 
of electricity passing through a cross-section of the conductor in unit 
lame: 



The current density i is a vector in the direction of motion of the 
electricity, whose magnitude is equal to the amount of electricity 
passing per second through unit area of cross-section normal to i. 
Then ’the current through any section is given by 

7 = J ids = f\i\ cos (i, n)dS. «... (2) 

It has been found experimentally that for metals and electrolytes 
the current is proportional to the applied difference of potential (or 
RM.F.) V, We write 1/22 for the factor of proportionality, and call 
R the resistance of the conductor. R may depend upon the temperature; 
hence Ohm’s Law, 


assumes that precautions are taken to hold the temperature constant 

2S* 
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in spite of the evolution of hea.t accompanying the passage of the 
current. In many instances the form 

V = IR (4) 

has mere direct physical meaning. This way of writing the equation 
states that if we have any conductor carrying a current, the potential 
difference between two points between which there is a resistance R 
is IR. If we have a closed, unbranched circuit consisting of various 
resistances, the first of Kirchhoff’s laws of electric circuits says that 
the algebraic sum of the potential differences across the conductors 
is equal to the sum of the electromotive forces (batteries, generators, 
&c.) applied in that circuit. The other law of Kirchhoff states that at 
any junction, the sum of all inflowing currents must be equal to the 
sum of all outflowing currents. This is merely a special case of the 
absence of sources or sinks (solenoidal property) for electric currents, 
and is almost self-evident. 

For the theory of direct action we require a formulation of Ohm’s 
Law which connects only the quantities localized at one particular 
point. For a cylindrical conductor the resistance R is proportional 
to the length and inversely proportional to the area of cross-section. 
The resistance offered by a centimetre cube of material to a current 
passing between opposite faces is called the specific resistance or resis- 
tivity p of the substance; its reciprocal, a, i3 called the conductivity. 
Hence the expression for the current in a cylindrical conductor may 
be written 

( 5 ) 


Now V/l is equal to the field in the direction of the generators of the 
cylinder and I/S is the current density. Since the current is in the 
direction of the field for isotropic bodies, these considerations hold for 
any small cylinder drawn within a conductor carrying a current. Hence 
(5) may be written in the vector form 

i — £tE (6) 

The fundamental implication of Ohm’s Law is that, for conductors 
such as metals and electrolytes to which it applies, a is independent 
of the field strength. 

The electric field strength is an irrotational vector in current- 
carrying conductors, just as it is in electrostatics. Also, there are no 
surface vortices on the surface of separation of two conductors: 

CuriE = [w(E a — By] =0 or E tan ® = E t . (7) 

But the normal component of E experiences a discontinuity at the 
interface; the surface divergence of i must vanish, sinoe any charge 
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which enters the surface from one ride must emerge from the other, 
so that 

Div i = n (i 2 — i x ) = n{u z E 2 - a x BJ = 0 or a 2 E„< 2 > = E„ (1) . (8) 

Thus the change in direction of the lines of flow at the surface- of 
separation of two conductors is the same as that of the electrostatic 
lines of force at the surface separating two dielectrics (p. 279): 

* 223*3 ( 9 ) 

tana a (t 2 

Likewise, no lines of flow can originate within the conductor, so that 
we have 

dive — div(crE) — — adivgradF — — oAF = 0. . (10) 

This means that inside homogeneous conductors V satisfies the same 
differentia] equation as it does in the electrostatic field in free space. 
Moreover, the boundary conditions are also the same, except that the 
constants have a different meaning. This property of steady fields may 
be used to map the field experimentally. If it is desired to determine 
the static field about a body of arbitrary form, this can be done by 
immersing a metallic model in a large metal basin filled with an elec- 
trolyte. The body is connected to one pole of a source of potential, 
and the basin to the other. With the help of a Wheatstone bridge the > 
equipotential surfaces may be determined with ease. Since the con- 
ductivity of the solution is small compared with that of the metal, 
the condition that the surface of the body be an equipotential surface 
is practically realized. 

Ex, 82. A condenser of capacity C is at the potential F 0 at time t = 0. If 
it is discharged through a resistance R, calculate the resulting ourrent. 

Ex. 83. A oondenser G is charged through a large resistance R by means of 
a source of direct potential V 0 . A glow lamp having a striking potential V, is 
connected in parallel with the oondenser. What is the frequency of the resulting 
pulsating current? 


2. Generation of Heat in a steady Electric Field. 

According to p. 267, the work done when an electric charge e is 
moved from a point where the potential is F, to a point at potential 
F g is 

W = e(V x — FJ (11) 

If the carrier of the charge is free to move, this work appears as kinetic 
energy; in steady fields within conductors, the work is transformed 
into heat. The atomistic view of conduction interprets thiB heat as 
that arising from “ frictional ” motion of the charges. Hence, in a 
section of a conductor bounded by two equipotential surfaces at 
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potentials V t and F 2 , the amount of heat generated per second by a 
current I is 

W — I (F i — F 3 ), ...... (12) 

measured in energy units, since 1 is the quantity of electricity which 
passes through a potential difference V t — F 2 each second. Using 
Ohm’s law to eliminate the potential difference from (12), we have the 
usual form of the equation for the so-called Joule heat los3, 

W-^PR (12a) 

From the view-point of the theory of direct action, a differential 
formulation is desirable. Let us take a volume element in the form 
of a small cylinder whose lateral surface is formed by lines of flow and 
whose ends are portions of equipotemial surfaces. If the altitude of 
the cylinder is h and the cross-section is 8, then the potential difference 
between the end surfaces is 

Fi — V z — h\ grad F j = A j E j (F x > F 2 ). 

The current is I = S j i |, so that the quantity of energy dissipated as 
heat in the volume element dr each second is 

dW — j * J | E j S/i — oE?dr — — dr. . . . (13) 

It may be shown without difficulty that this equation is true for an 
element of volume of any kind. 



CHAPTER XV 

Th® MaGSDSTOSTATIO PlEU) 


1. Comparison of FSoctrosfcaSio and Magnetostatic Fields. 

Besides electrical phenomena, there is a second type of non 
mechanical effects whose close connexion with electricity was recog- 
nised at an early period, viz. magrutic phenomena.. Just as an electro- 
static field can be maintained without energy supply, so there exist 
also magnetic fields which persist without the necessity of supplying 
energy. The field between the poles of a permanent magnet is of this 
type, also '••he magnetic field which, as we shall see, accompanies a 
steady curie ot. These are included in the classification magnetostatic 
feld. It ia true that energy is dissipated continuously in the latter 
case, but this ia merely a consequence of the finite resistance of the 
conductor, and has nothing to do with the magnetic field, whose 
strength depends solely upon the current. If- the resistance of a con- 
ductor could be made arbitrarily small — « condition which is almost 
realised at very low' temperatures (ci p. 449) — these magnetostatic 
fields could also be maintained without consuming energy. 

If, in analog 7 with the electrostatic field, we try to map the field 
by using a magnetic tost body, we encounter the first difference be- 
tween eketrostatio and magnetostatic' fields: no test body exists 
having an independent magnetic chargs, Sven the smallest fragment 
of a steel magnet is a dipole which behaves in «. magnetic field much 
as an electrics! dipole does in an electrostatic field. Hence we can 
explore the field only by observing the torcp .0 .-carted upon a small 
magnet of know;’; magnetic moment Prom the calculation for the 
corresponding alee; Heal case (p. 26://. this . evqi . ’■» 

l M i - - i m j j H i sin 9, or m venom: fcan, A ** [wH] , (1) 

where H is the magnetic field strength. The position of equilibrium is 
that for whs-ib 8 ■■■• 0; for small displacements, the twisting couple, or 
‘ moment c; torsion ”, is 

t « i $>* 1 !H| - . (10 

it fcfie momenv of inertia of the small car magnet is known, r may be 
determined erperimenrally by observing' the period of vibration in 
.he field [use equation (20) (p. "* 4.S) J This fa the well-known Gauss 
method for determining the bonbon cal intensity »!.' the earth’s magnetic 
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field. In order to obtain the field s trength H from the value of the torque 
M, we must also know the moment wt of the test magnet. Since the 
fields of magnetic 'systems follow the same laws as those of electrical 
charges, we can calculate the field of a magnetic doublet by the results 
of p. 272; then, by observing the forces experienced by other dipoles 
in this field, we can obtain a second equation connecting the two 


unknown quantities m and H. A detailed description of such an 
experiment may be found in any standard treatise on electricity and 
magnetism.* The absence of isolated magnetic charges, or poles, implies 
div H —• 0 (2) 

at all points in a vacuum. The magnetostatic fidd in a vacuum is every- 
where solenoidcd (free of sources, or non-divergent). The electrostatic 


field, which may have a finite divergence even in a vacuum, is never- 
theless characterized by complete absence of curl. This is by no means 
always the case for a magnetostatic field, as may be seen from the 
simple example of the field about a straight wire carrying a current. 
As we know from elementary experiments, the lines of force in this 
case are circles whose centres are at the wire, their planes being normal 
to the wire. If we take the line integral' <j) 'Eds along one of these 
circles it cannot vanish, since we are continually adding quantities of 
the same algebraic sign. Hence the iixotational condition is no longer 
fulfilled by the magnetic field, * QX paths encircling lines of flow. Thus, 
while the electrostatic field in a vacuum must he everywhere irrotational 
but may be divergent, the magnetostatic field in a vacuum is non-divergent 
at all points, but may have a finite curl. 

In the same way as the electrostatic field between the plates of a 
condenser is reduced by a factor K when a medium of dielectric con- 
stant K replaces the vaeuuin between the plates, so the field between 
the poles of a magnet is changed by the introduction of a medium 
having different magnetic properties. The quantity corresponding to 
the dielectric constant K, which is analogously defined as the ratio 
H*ac : H, is called the magnetic permeability p. The quantity /*H cor- 
responding to the dielectric displacement D is called the magnetic 
induction B. As in the electrical case, the effect of the material may 
be represented by a polarization of the medium.' 1 The magnetic quantity 
corresponding to the electrical magnitude P (the polarization) is called 
the intensity of magnetization 

M = /cH = [(p. — l)/4rr]H, ..... (3) . 
where k is the magnetic susceptibility. The intensity of magnetization 

* If the force is measured in C.G.S. units, m and H will thus be given in this system. 
The unit of magnetic field strength so defined was formerly called the gauss but is 
now designated the oersted. In the M.K.S. system th6 field is specified in terms of the 
current producing it (cf. § 2). 
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is also aqua! to the magnetic moment per unit volume. The mag- 
netic susceptibility diiiers in an important way from the electric 
susceptibilit} : there are substances for which k is negative as well 
as substances lor ?.|hich it is positive. In contrast with electric polari- 
sation, which is always in the same direction as the field, the mag- 
netization may be either in the direction of the field or in the opposite 
direction. But even in the electrical case, we found (p. 280) two dif- 
ferent ways in which the polarization may come about. The same 
two 'possibilities exist in the magnetic case also, except that tho 
difference is far more apparent here, inasmuch as the one land (align- 
ment of the already existing dipoles) yields positive values of k (para- 
magnetic substances), while the other kind (creation of dipoles by 
induction) gives negative values of ;e (diamagnetic substances). While 
the electrical susceptibility of liquids and solids lies between 0*1 and 
10, the magnetic susceptibility, with few exceptions, amounts only to 
about 10"~ 4 or even less But for the so-called ferromagnetic metals, 
which are mostly metals of the iron group and certain alloys, * assumes 
enormously greater values (of the order of 10 s ); however, te is not 
constant, but depends upon th. strength of the applied field. This 
fact may be understood from the way in which paramagnetic sus- 
ceptibility is believed to arias.' if the resultant dipole moment may 
be ascribed to the alignment of existing dipoles, then this moment 
cannot remain proportional to H while H increases indefinitely, since 
an upper limit is attained when all the elementary dipoles finally hwf> 
been brought into line with the field. 

After this digression on the subject of the atomistic interpretation 
of susceptibility we return to the description of the experimental pro- 
perties of the magnetostatic field, which is wholly analogous to that 
for the electrostatic field. On account of the absence of true magneto 
charges, we have always 

div B = div (pH) — 0 and Biv B — — g. s H„‘ 2 > = 0. (4) 

On the other hand, H is derived from apparent (fees) charges residing 
in non-homogeneous parts of the media: 

div H = 4 t rp m * Div H — 4wcr OT (5) 


If no finite (“ surface ”) current flows in the surface of separation of 
two media — a case which we exclude — then Curl H — 0. Hence the 
refraction of the lines of force at the interface is governed by the 
tangent law 


tan ctj /Xjj’ 


(6) 


Just as there are no true single magnetic charges, so there exist 
no conductors of magnetism. On account of the high value of g. for 
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ferromagnetic substances, tbe lines of force reaching these are very 
nearly normal to tbe. surface, as may be seen from (6). This means 
that such substances behave very much like conductors, with regard 
to the static arrangement of the lines of force. 

The expression for the magnetic energy density also corresponds 
lx. that for the electrostatic case : 

u ttaa = H 2 = - 1 BH, or =r in M.K.S. units;’" (7) 

* o77 8n 2 

hence the calculations of § 3, Chap. XIII (p, 290), may be applied to 
magnetism, aa already mentioned there. 

Ex. 84. As a first approximation, the earth may be regarded as a sphere 
magnetized with uniform intensity. Calculate the angle of dip as a function of 
the magnetic latitude. 

Ex. 85. In a certain terrestrial magnetic survey, an area of J3-1 x Ip* kci. 3 
was enclosed by a path from New York to England and back. The path was 
described in the counter-clockwise direction, as seen from above. The result was 
that (j) Eds — —64 x 10* oersted cm. What does this mean? 

2. Calculation o X, the Magnetostatic Field accompanying a Given Dis - 
tribution of Electric Currents in a Vacuum. 

In elementary physks the magnetic field due to steady currents 
is calculated by means of a formula from the theory of action at a 
distance. This Saw, which gives the magnetic field due to an element 
of a conductor, is the familiar rule -of Biot and Bavarfc (p. 303). How- 
ever, this law is not a suitable basis for obtaining a more general re- 
lationship corresponding to the direct-action point of view; moreover, 
the Biot-Savart rule cannot be verified experiments ily, since there 
are no isolated elements of conductors — at least for steady fields — 
but only closed circuits. Hence we start from a different empirical 
fact. If we snap the magnetic field of a conducting wire, as outlined 
on p. 297, and calculate the line integral (p Eds along any arbitrary 
curve which encircles the conductor, we obtain the same value, pro- 
portional to the current in the circuit, no matter how we choose the 
path of integration, as long as the circuit threads it: 

< f>H.ds~yl . (8) 

If we measure H in Gaussian magnetic units [the connexion is given 
directly by the Gauss method of determining the field (p. 298)], and 
if 7 is met- a- red in the electrostatic system, then, in order to make the 
equation numerically correct, wo must take the value 


* Tee quautity fi,„ wSJ be defined on p. 12 , 
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where c is numerically equal to the velocity of light in a vaouuin, 
expressed in cm. /sec. If, on the other hand, we take y = in, we 
thereby determine a new rmit for I — the electromagnetic O.O.S. unit, 
which, however, we wish in principle to avoid using in spite of the fact 
that the practical units such as the ampere and the volt differ from the 
electromagnetic C.G.S. units only by powers of ten. La the M.K.S. 
system there is no specific unit of magnetic field strength, but the field 
is defined by setting y — I in equation (8). Thus the field is specified 
by the current producing it, and has the dimension amp./rnsfcre, 

We wish to bring equation (8) iato a form consonant -with the theory 
of direct action. Imagine that we have given an extended conductor 
in which the current density vector i is known as a function of posi- 
tion. If we draw a small closed curve G inside this conductor, the 
current passing through it is 

I = idS, 

where dS denotes the directed element of area bounded by C Thorn 
the definition of our], 

H is ~ (curl R) <iS 


for this element of area. According to equation (8), this must be 
equal to 

4 ml __ ini dS 


Since this relationship mast hold for any element of area whatsoever, 
for slowly varying fields 


curl H — 


4w* 


, (9) 


or m M.K.S. units, curl K = oJEL 


O') 


This is the differential form of the law corresponding to theory 
of direct action. The vector equation states that the field lines of 
the current density vector are the same as those, of the vector curl B 
Since the divergence of a curl is identically zero (p. 28). the lines of 
flow can neither originate nor terminate in the finite region. They 
must either form dosed curves or go to infinity. 

The problem for the electrostatic field was to deduce an motatiomd 
field from a given distribution of sources, or charges. This task was 
simplified by the introduction of the electrostatic potential, which 
could be computed readily by integration over all the charges. Ie 
the complementary problem now before us — the calculation of 
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solenoidol or non-divergeni, field from tlie distribution of the vortices 
(currents) — we likewise simplify our task by introducing an auxiliary 
quantity. This quantity is no longer a scalar, but a vector, which we 
call the vector potential. We take 

H = curlV. ....... (10) 

The non-divergence of H is thus provided for. Than by equation (98) 
(P- 40), 

curl H — curl curl V = grad div V AV — — . (11) 


Without loss of generality, we can assume that divV vanishes; for if 
it did not do so, we might add to ¥ an irrotational vector having the 
sapie sources as V, but with opposite signs, without changing H in 
any way. Thus we obtain the same type of equatiou for the vector 
potential as for the scalar potential in electrostatics: 


In component form (cf. p. 40): 

AT/ 47ri B A ^7 


_ 


( 12 ) 

(IS) 


The solution of this set of equations is known from electrostatics: 
if we are dealing only with currents distributed in space (surface cur- 
rents, corresponding to surfaoe charges in electrostatics, are tem- 
porarily excluded), the solution is 

F.-ris* . a*) 

J or J cr J or 

The component forms may be combined again into a vector equation: 

V-/£. (15) 

By means of this equation the magnetic field of a linear conductor 
(wire) may be determined at once. Here the vector i is in the direc- 
tion of the axis. Let the element of length ds be in this direction 
also, and let the cross-sectional area of the wire be S. Then 

*dv = ~Nds, ( 16 ) 

and thus, since I has the same value at all points of the conductor, 

'-Jr ™ 
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The integration is to be extended over all the elements of the con- 
ductor and has nothing to do with taking the curl of V, in which 
operation the co-ordinates of the field point are differentiated. 

< 18 > 

The factor 4rr/c on the right side of (12) is absent in the M.K.S. system, 
so that in these units T rJ •> 

w 

Here r 0 is a unit vector drawn from the segment ds toward the field 
point. We can interpret this equation to mean that each element of 
length of the wire contributes an amount 
/ [ds, r 0 ] 
a r 2 

to the field. But this is precisely the Law of Biot and Smart, according 
to which the field at F due to the current in the element ds is normal 
to the plane determined by ds and the radius vector from ds to P, 
and is of amount 7 ds sin 6joF. 

Again, it may be shown without much difficulty that a closed 
linear current (circuit) gives rise to the same magnetic field as an 
arbitrary surface covered by magnetic dipoles which have the direc- 
tions of the normals , to this surface, and which have a magnetic moment 
of Ijc per unit area, the surface being bounded by the circuit. A sur- 
face thus uniformly covered by normally set magnetic doublets is 
called a uniform, normally magnetized shell. It is, of course, further 
understood that the poles of one kifid are all on one side of the shell. 
This equivalence of a magnetic shell and a current may easily be 
proved with the help of certain transformations of vector analysis. 
We must be careful to note whether the field point or the element of 
the conductor is the element which varies. All differentiation referring 
to a displacement of the field point will be characterized by the sub- 
script fl derivatives relating to variation of the element of the con- 
ductor will be denoted by the subscript s. We again note, as on p. 281, 

that grad,/(r) = — grad 5 /(r). 

First we transform the expression for the vector potential at the 
point P by a generalization of Stokes’s theorem. Let u be a scalar 
point function, and let it be required to transform the line integral 
<j) uds into a surface integral. We multiply scalarly by an arbitrary 
constant vector a and obtain the following result by using Stokes’s 
theorem and equation (89) (p. 38); 
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a f uds — <f uads —J [curl iua) ] dS -~ 

f [gradu, a]d8 — afld&, grad u]. 

Since a is perfectly arbitrary, we must have in general 
fuds j [d£, gradu]. 

If we apply this equation to the vector potential as given by equation 
(17) (p. 302), we must remember that the field point is fixed, and 
hence all derivatives relate to change of the elements? -of the conductor. 
Hence 

V — ~~ If j~dS, grad,- J — ~ J |V*„grad,~ jd$, (19) 

If now we compute the field strength H by taking the cmi, the field, 
point is to vary, so that 

H — ^ curl / f \ n, grad, £ j dS —■ -^fcuxl^n.. grad,- jcfe? 

*= — ^ f curl/ j n grad/ ~ j dS. 

! rhe integrand may be transformed by introducing the ^-operator: 
curl/ j%*. grad/- ] «= [V/Tm V/^J j 

»* n . V f Vf - - V, • n V s f . . (20) 

Since, however, V/ 7/ = A/ - --- 0, we obtain 

3S -j- f grad/1 -f- - n, grad/i \dS 

J \ e r/ 

^-grad//{j ^48. . . (21) 

Hence tbs magnetic field is derivable .from the. scalar potential 

- - • w 

where r is the radius vector drawn to the field point from the element 
dS of any surface bounded by the conductor. Comparison with the 
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expression far the potential of a double layer [(29'}, p. 273} shows 
that the dipole moment per unit area is Ijc and that the direction of 
the dipoles is that of the normal *» to the surface, the sense being 
such that the current flows in the clockwise direction when sighting 
along n In M.K.S. units the dipole moment per cm. 8 h 

But the potential has different values on the two sides of the sur- 
face, since the integral j) W&s must amount to itrl/c- for a complete 
circuit, and the contribution of the arbitrarily small space occupied 
by the shell must vanish. This confirms the result found .m p. 274, 
that the potential hits a discontinuity amounting to 4ar j m j at a 
double layer. The result of (21) may be stated; At points outside any 
surface bounded by a dosed circuit, the magnetic field is that which would be 
produced- by o, double layer covering this surface uniformly* The surface 
may have any form. It represents a surface of discontinuity such that 
or passing through it the potential must be increased by irr j tn j. 

Ex. 8(3 Calculate the magnetic field at interior and exterior points of an 
infinitely long solid eyliodrical conductor of radius R. 

3. Calculation of the Magnetic Field accompanying Slecsric Currents 
when Ferromagnetic Materials are present. 

If we fill the region about a current-carrying conductor with bodies 
of various permeabilities, experiment shows that the line inter al 
:p Bids 'taken over any dosed curve continues to have the value iirlfc, 
rhere 7 is the hotel current enclosed by the path of integration. It 
follo ws that equation (9) (p. 301 ) is independent of the permeability of 
the medium. Hence, hi contrast with electrostatics, where the field due 
to charged bodies is reduced by a factor K by filling the apace with 
a medium of dielectric constant K, the magnetic field of currents re- 
mains unaltered if the entire space be filled with a medium of per- 
meability ft. This is true, however, only provided the entire region, is 
thus filled (see below) 

Since, os the other hand, the formal application, of the theory 
developed for electrostatics requires that the magnetic field of ficti- 
tious single charges or dipoles be reduced by a factor y in & medium 
of permeability ft, we m> st increase the strength of the equivalent 
magnetic shell of the circuit by this factor if H is to remain the same, 
when the permeability is changed to a. Thus we must give the shell 
a moment, ulje per mat area, or in M.K.S. units, mu,!. 

If, however, we place ferromagnetic materials in certain portions 

* This v’ns stated by Ampire in his Thtorit its phtwminss 4leclro-dynainu/uefi, Mim- 
circs it 1‘lnstHut, IV (1323), to fellows: “ Every Honan oonduotor carrying a current 
is equivalent to a simple magnetic shell, the bounding edge of which coincides with 
the conductor, and whose moment per unit area — that is, the strength of the shell — 
in proportional to the strength of the ourreat.” 
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of the region only, a change in the field occurs in the magnetic oase 
also, and this change is considerable on account of the high value of 
the permeability. Since equation (9) (p. 301) is independent of the 
medium, the field H and the field Hq in a vacuum must have the same 
vortex lines, vis. the lines of flow of the current, which are the field 
lines of the vector i. Hence the field to be added to Ho, viz. H — Eg, 
must be irrotational and hence derivable from a scalar potential. It 
naturally suggests itself that this scalar potential may be Bimply that 
of the dipoles of the magnetized bodies. This is really the case, and 
may be demonstrated as follows: Let us assume the usual case where 
we deal with individual ferromagnetic bodies, each of uniform, per- 
meability. Then by § 1 (p. 299), there exists no volume divergence of 
H, but there is a surface divergence at the bounding surfaces. This 
divergence is connected with the apparent surface charges whose 
surface density is a m ' by the relation 

Div H — Div (H — Up) = 

On t^e other hand, on account of the absence of spatial source, 
(charges), 

div H = div (H — Hq) ~ —div grad F — — AF — 0. 

The solution of this equation is known from electrostatics. If the 
only sources are those distributed on surfaces, this Solution is 

Now since DivB is always zero (absence of true magnetic charges), 
we have, from (3), p. 298, 

Div M — — i Div H = — <r m '. 

4?r 

Hence the potential, from which the supplementary field is derived 
becomes 

V = ~J dS (23) 

The integration is to be extended over the surfaces of all the ferro- 
magnetic bodies. Since these must be separate closed surfaces, we 
have the added potential in the form given for the electrostatic case 
at the end' of § 2, Chap. XII (p. 281), Using Gauss’s theorem, and 
reversing the argument used there, we can represent the potential as 
an integral over the volumes occupied by ferromagnetic substances: 

F = (24) 
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But tlie last expression is exactly tie potential of the dipoles of the 
magnetized bodies; this is the fact we proposed to demonstrate. 

If the field which produces the magnetization M were not itself 
altered by the presence of the ferromagnetic substances, calculation 
of the field would be a simple matter. In that case, one could first 
compute the vacuum field strength H 0 from the distribution of the 
currents, then determine from this the magnetization of the bodies 
introduced, and thus calculate the supplementary field of the dipoles 
from this result. How far from the truth this result would be is shown 
by the example of the sphere in an originally homogeneous field — 
a problem which we already treated for the electrostatic case. Accord- 
ing to equation (23) on p. 284, the interior field which determines 
the magnetization is no longer H 0 , but 

<“> 

so that for ji = 2000, H {7) is only about 0-16 per cent of Ey. 

As we, saw above in the electrostatic 
case, the exact calculation of the field is 
possible only for bodies of simple form. 
A case of practical importance, viz. the 
field between the poles of a ring-shaped 
electromagnet, will be calculated approxi- 
mately in another way. We choose a special 
kind of ring-magnet consisting of a ring or 
tore of ferromagnetic material spirally wound 
uniformly over its whole length with a wire 
carrying a current (fig. 1). From the sym- 
metry of the arrangement we see that 
the magnetic lines of force must be circles 
whose centres are on the axis of symmetry of the ring, and which lie 
in planes perpendicular to that axis. If we take the integral <ji Hds 
along a circle K 1 or 2T 2 lying entirely outside the ring, we see that this 
integral is zero, since no lines of flow are threaded by the path of 
integration. On the other hand, for such a circle within the ring, each 
of the N turns is threaded once, so that the integral over the entire 
path is 47 tNI/o. Thus the field is confined to the space within the 
windings and is of amount 

H — 4xrNI/(2mc) = 2 NIj(ac), or H — NT /(2m), (26) 

where a is an average radius. The M.K.S. measure of the field is 
especially vivid, for NI /(2m) is merely the number of ampere-tums per 
metre. If, as we shall assume, the cross-section of the ring is small 
compared with its diameter, the field within the ring will be approxi- 
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mately uniform. We now out a section of thickness l a out of the ring. 
How large is the field H. in the gap thus made ? If we assume that the 
lines of force retain their circular form, we can compute E a very easily. 
Actually, a considerable dispersion of lines of force takes place; the 
field in the neighbourhood of the gap is no longer confined to the ring. 
If we neglect this scattering, then, H ( being the field within the iron, 
we have 


jxEi— H a 


(27) 


on account of the continuity of the normal component or B at the 
iron-air surface. Again, if the portion of the circle within the iron is 
of length !- it the Hue integral of the magnetic force is 


IMi + , 


l a + hit* 


(28) 


The actually measured field H a is smaller than this, on account of the 
spreading of the lines of force. The ratio H a :S a \a called the coeffi- 
cient of scattering. 


For 


cross-section 8, the flux in the air gap is 




4rcA~7 /c 

rjs+y$' 


In electrical ergiaeoring, this formula is brought into forms! correspondence 
with Ohm’a Saw hy calling inNI/o the “ magnetomotive force”* and calling the 
denominator th t- “ magnetic reluctance ”, Although this analogy is purely 
formal, it furnishes a practical rule for computing “ magnetic circuits If, for 
example, we have several different cross -sections, we obtain the flux roughly by 
using the expression '2ljfo j 8 f for the magnetic reluctance. 


4. Pondercmc&ve Forces on Conductors In a Magnetic Field. 

The calculation of the forces acting upon current-bearing con- 
ductors in a magnetic field is eonsiderabijr 

'gtoyplifieji by making use of Ampere's 

1_'_£ theorem concerning the equivalence of a 

— ~ H *•' magnetic shell and a current (p. 80C). 

— — I — — X Using the formulae derived in our fcrsat- 

a- ■*" ment of electrostatics, we then cats give 

' M immediately the potential energy of the 

shell. The work done by the force m ob- 
tained by calculating the energy change 
accompanying a slight displacement, and from this we arrive at the 
force itself. We apply this method to two important cases : 


* In M.K.8. units this is merely N/ r i.e, the number of ampere- inns. 
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L The moving-coil galvanometer. A cylindrical coil of N turns 
is suspended in a uniform magnetic field by a torsion fibre which is a 
continuation of a diameter of the coil (fig. 2). If a current - 1 flows in 
the coil, the magnetic moment of the coil in air is NIS/o, where $ is 
the area enclosed by each turn. The magnetic moment vector is normal 
to the plane of the coil. If this moment makes an angle 6 with the 
field H, then by equation (27) (p. 285) the torque acting on. the sus- 
pended system is * 

NTS 

— — H Sind. (23) 


'This moment tends to turn the c 
K is parallel to H. The torsion of t 
If 0 — st/2 js the rest position wh< 
a maximum. If the displacements 
Since the restoring moment of th 
placement < 

\\\\\\\\ tion is deto 

' 


» the stable position where 
•ension opposes this turning, 
mrrexit flows, this torque is 
nail, we can take sir- 6 1. 

is proportional to the dis- 
1 — 6, the equilibrium posi- 
by 


4 

\\W-- \V\ where t is the moment of torsion of the fibre. 
\'dr Y\\Y\H 2. Force on a segment of a current-carry- 
YVwXhVX ing conductor in a magnetic field. Imagine 
Fig. s a segment ds of a circuit which may be slid 

back and forth (fig. 3). Ws seek, to deter- 
thc force which the uniform field B exerts on this segment, 
yipere’s theorem and (26), p. 285, the energy of the entire circuit 
air (? 1): 

V = - 1 j HrfS m - *- U f ds «. -- ; S3. . (30) 

: area is slightly changed by giving ds a disj-lacsmenfe 8 a % the 


NSE 


I, . . (29') 

rsion of the fibre. 


W — - -• H8S- 


The vector 86 may be represented by the vector product [8a ds]. If 
the factors are taken in this order, the normal is obtained in the right 
direction.* This displacement is accompanied by ar- amount of work 
FSet done by the force. This work most be equal to the decrease in 
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so that the force itself is 

F==~[dsH]==~[z'HJc&r, . . . . (32) 

where dr is the volume of the part of the conductor of length ds. 
This force, tending to enlarge or decrease the area of the circuit (de- 
pending upon the direction of the current), is always present, whether 
it causes the conductor to move or not. If the current in a coil is very- 
large, the magnetic field of the coil itself gives rise to large mechanical 
forces acting on the various portions of the winding. 

By p. 305, the magnetic moment per unit area is increased by a 
factor p, if a medium having a permeability of this amount replaces 
the air. In this case formulae (30) and (31) must be multiplied by /.a, 
and we have for the force 

W — - [zffjdr, or =ju/u, 0 [fH] dr. . , (ST) 

Although in computing the force we may replace a circuit by a mag- 
netic shell, we must avoid setting the potential energy used here equal 
to the field energy. The reason is that something new appears when 
the conductor is moved through the field, viz. am induced ds<trn- 
motive faros (see next chapter) whose effect is to weaken the current. 
In order to maintain the former current, energy must he supplied. 
The net effect is that for a fixed, current the total field energy is found 
to increase by exactly the amount by which the magnetostatic poten- 
tial energy would — according to the above calculation — decrease. 

Ex, 87, The Ballistic Galvanometer,— A current of very abort duration is 
sent through the coil cf a suspended- coil galvanometer at the time f =*= 0, the 
total quantity of electricity which flows being e. What is the maximum displace- 
ment? (Take the time integral of the for ce, and compute the amplitude of the 
vibration from the initial velocity thus found,) 



CHAPTER XVI 

Slowly Varying (Quasi-^tationary) Fislbs 
Definition of quasi-stationary fields 

All electric and magnetic fields so far considered have been inde- 
pendent of tlxe time. We now go a step further and consider fields which 
vary with the time, but restrict ourselves for the present to fields 
varying relatively slowly. These are of great importance. We shall 
find later that electrical fields are propagated with the velocity of 
light. If, then, we restrict ourselves to not too rapid changes, and to 
systems whose dimensions are not too great, we may assume that a 
, state corresponding to instantaneous propagation holds at every point 
of the field. The relationships derived on this basis hold raider all 
conditions for the range of alternating current frequencies in technical 
use (10 to 1000 cycles per second). 

1. The Law of Induction. Maxwell’s Equations. 

Imagine a closed linear conductor in a magnetic, field. Let- the 
surrounding medium have a permeability a. We know feem elementary 
experimental physics that if the field EE changes, a current is “ induced ” 
in the conductor. This means that there must be an electric field E 
within the conductor. Since we assume the conductor to be linear, 
E has the direction of the linear elements ds. If we compute the 
integral <p aids, say in the direction of E, this integral can no longer 
vanish, since all the contributions to the sum are of one sign. We 
thus encounter a rotational electric field for the first time. The value 
of j) Eds is called the induced electromotive force, for the following 
reason. If we imagine the conductor severed between two neighbour- 
ing points Pj and P g , the work which must be done to move unit 
charge from P a to P t against the field is 

—W = f Eds, 

since the contribution of the open portion P^Pg is vanishingly small. 
Hence there is a difference of electrical potential * of this amount be- 
tween P, and P 2 . The Law of Induction states that the induced elec- 

* This may manifest itself, for example, in the form of a spark passing between 
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tromotive force is equal to 1/e times the decrease per second of the 
number of lines, of magnetic induction enclosed by the circuit.* The 
usual orientation of surface normal and direction of describing the 
circuit is assumed. The mathematical formulation of the Law of 
Induction is then 

f Eds = — ~ funds (1) 

In the M.K.S. form there is a quantity on the left having the dimen- 
sion volts ; or. the right, amp. /sec. Hence there appears a new dimen- 
sional constant, called the permeability of free space, whose numerical 
value is 

/i 8 = 1-256 x 10~ 6 volt sec. /'amp. 

The M.K.S. form of the law is thus 

j> Eds =» — ~ yVw'HdS. .... (la) 

The line integral J Erf*' may be changed to a surface integral by using 
Stokes’s theorem: 

<£ Eds = J cur! EdS = — £ ~ j ulidS, . . ’ (2) . 

For stationary objects, integration with respect to position and dif- 
ferentiation with respect to time are independent, so that we may write 

f curl EdS = - ; f | f (/xH) dS. .... (3) 

Since this equation must be satisfied for any surface, the integrands 
of the surface integrals must be equal, and we obtain the first triple 
set of Maxwell’s Equations of the electromagnetic field: 

, _ a 3H 1 3B 

curl E = ~ - — - — - (3a) 


In the M.K.S. system, 

curlE= -wt, 3 ®. ..... (36) 

The Maxwellian theory assumes that this induced electric field is 
present even if not detectable in the form of & potential difference 
between the ends of a material conductor; in other words, this set 

* This mates- ei-i, sometimes referred to as Neumann's Law, is a fast, of experience. 
The part giving direction of the induced eleofcrotnotivu force is an example of what 
is known as tanre Law. 
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of equations is generally valid for expressing the magnitude of the 
electric field induced by a changing magnetic field. 

Ex. 88. Compute the current induced in the coil of a moving-coil galvano- 
meter as it swings, and find the damping of the motion of the cod produced by 
this current. 


2. Self and Mutual Induction. 

(a) Calculation of the coefficient of mutual induction 
Let there be given two circuits 1 and 2. If a current flows in circuit 
1, it generates a magnetic field H t whose lines of force pass through 
circuit 2. If the current 7, in circuit 1 changes, an electromotive force 
is induced in circuit 2. Its magnitude is given by 

— — fjuiS, (4) 


If the whole of space is filled with a medium of uniform permeability 
H, then according to Chap. XV, § 2 (p. 302), Hi is derivable from a 
vector potential: » 


= curl V x , Vl = - — h&dffl. 

cJ r„ c J n* 


Hence, by Stokes’s Theorem, 

J H^S. — f (curiVJdSj = £ V l( 7s 2 = ^ (6) 

Le. the electromotive force induced in circuit 2 becomes 


F 2 = j) Eadfij 


ft dL r rds x ds t 

5 Tiff -£r*- 


. . ( 6 ) 


The quantity or ^ j) M.X.S. units (henry) 

which depends only upon the geometric relationships of the two cir- 
cuits and upon the permeability of the intervening medium is called 
the coefficient of mutual induction, or the mutual inductance L }i . With 
this notation (6) may be written 

F a = ^E ^= — 7^^ (6') 


Since both circuits enter into the expression for L n symmetrically, 
we obtain the same expression if we calculate the electromotive force 
induced in circuit 1 by a change in the current flowing in circuit 2, i.e 

• ( 7 ) 
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(b) Self-induction 

Even if there is but one circuit, a change in current causes a change 
in the magnetic flux through the circuit which, by the Law of Induction, 
results in an induced electromotive force. In the same way as we 
defined the coefficient of mutual induction above, we define the 
coefficient of self-induction, or the self -inductance to be a certain positive 
magnitude depending only upon the geometric form of the circuit 
and the permeability of the surrounding medium. The defining 
equation is 

§Uds = -L n d ± . . ( 8 ) 

Here, too, we must use the negative sign, since the direction of the 
induced electrical field is such as to oppose the original current (Lens’s 
Law). Were it the opposite way, the original current would be in- 
creased by the self-induction, hence a still greater electromotive force 
would be induced, and so on without limit. 

If we attempt the calculation of the self-inductance of a circuit 
in a way similar to the above computation of mutual inductance — i.e. 
by calculating the field at ds caused by all the other current elements, 
then computing the total induced electromotive force by integration 
over all conducting elements — the resulting formulae are found to have 
only limited application. In calculating mutual inductance we are 
permitted to assume that the conductors are linear, since the separation 
of the two circuits is taken to be large compared with their cross- 
sections. For the case of self-inductance, however, this is no longer 
the case, and we must divide the current into individual filaments 
and take into account their mutual influence. Thus the calcula- 
tion of self-inductance is generally more troublesome than that 
of mutual inductance, so that we shall omit the detailed treatment 
here. 

In the case of coils, we are mainly concerned with the magnetic 
field at the position of a selected turn caused by the current in the 
remaining turns; hence a division of the current into filaments is not 
required. In this instance the coefficient of self-induction may be 
calculated simply and to a good approximation by another method. 
We start again with a ring-magnet of N turns, the cross-section S 
being small compared with the radius a of the ring. The field inside 
the windings is [equation (26) (p. 307)] 

s _ InNI _ 4mNl 
2nac ~ In 

If the strength of the current changes, the total magnetic flux 
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through the N turns will also change, and this change, per second, is * 
d r , d<Sf , ro dH A7 .„47tW dl 

3 J l ‘* d> ~ M s . 

so that the induced electromotive force is 

The self-inductance of the ring-shaped coil is therefore 

. . (9) 

This result holds approximately for a very long unclosed coil also. 
The approximation consists in neglecting the disturbance of the uni- 
form field caused by the ends of the coil. 

(c) Transformation of the expression for the magnetic field energy 
The energy of a system of current-bearing conductors is mainly 
that of the magnetic fields caused by the several currents. Since the 
electric fields present in such systems are small, the electrostatic energy 
is relatively negligible. With the help of the concepts of self and 
mutual induction, the expression for the magnetic energy may be 
brought into a form which is very useful for many purposes. Accord- 
ing to equation (7) (p. 300), 

V m — ^ J pti? dr, (10) 

We assume that space is filled with a uniform medium of permeability 
p, although the final expression obtained will be valid even if the 
permeability varies from place to place. pH is non-divergent every- 
where, but H is non-divergent only in a homogeneous medium. We 
may thus write 

1 H = curl V. 

Now by equation (92) (p. 38), H (curl V) = V (curl H) + div [VH], 
Substituting this expression in equation (10), we can transform the 
second integral into a surface integral over an infinite sphere. - How- 
ever, by p. 302 we see that V decreases as 1 jr, so that curl V decreases 
as 1/r 8 , and hence this integral vanishes. As a result, we have 

Um== &rf V (CUTl H)< * T = £ / VidT ‘ * * (U > 

* Here, and in the formulae following, integration implies summation over all N 
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If we have only linear conductors, which, of course, mast be closed 
if currents are to flow at all, (11) yields 

..... ( 11 ') 


where /* is the current in the Ath conductor. If we now decompose 
the vector potential at the position of circuit k into the parts V v V a , . , . 
contributed by the various conductors (including circuit k), then we 
have by equation (15) (p. 300) and. the definitions of the coefficients 
of induction, 

<£ VfdSf. — - L it I, and <£ V k ds k = - L ik /*, (12) 

J S* « a 

Hence we have 
tf m * g = MW + 2L n IJ a 

+ . . . 2L ln I 1 I n + L 22 l* + 2L 23 / 2 7 s -f . . . i nn /„ 2 ). (13) 

Ex. 89. A charge — e revolves about a fixed charge +e in a circle of radius 
a. How much energy is absorbed by this system when a magnetic field H is 
applied, the angle between the field and the normal to the orbit being 0 f 
(To simplify the calculation, assume that |^y | = const.) 

Ex. 90. (o) Calculate the mutual inductance of two coaxial circles of wire, 
radii a ; and a t , whose centres are at a distance * apart. 

(b) Show that for two conductors L u L 2i > always. 


3. Stationary Alternating Current Circuits. 

With the help of complex numbers we can derive relationships for 
the stationary state of uniform periodicity attained by circuits to 
which periodically alternating potentials are applied. The steady 
state referred to is that arrived at some time after the initial applica- 
tion of the electromotive force. These relationships permit us to 
extend formally to alternating current circuits the laws of Ohm and 
of Kirchhoff derived above for direct currents. 

Let us consider a circuit consisting of a source rf periodically 
alternating electromotive force V — say a dynamo — connected across 
a conductor of ohmic (steady current) resistance R and self-inductance 
L. In addition to the applied E.M.F., there is also that due to the self- 
inductfon of the circuit, so that 

«> 

If R may be neglected, we have 


(14') 
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Assuming the phenomenon to be strictly sinusoidal, 
V — F 0 e‘“ ( , I — / u e‘“' 1 . . . 

dl 
dt = 


Hence 


= uoZ 0 e i, " ( = Uol, 


and so 


V = 


( 15 ) 


(16) 


Comparing this expression with Ohm’s law, we see that a formal 
correspondence is obtained by setting the alternating-current resis- 
tance caused by self-induction — the so-called reactance of the circuit — 
equal to iosL. What is the physical meaning of this imaginary, resis- 
tance of self-induction? It has been found of great advantage to re- 
present alternating current phenomena by means of the vector 
diagram {cf. p. 52). If we place the “ vector ” of the current along 
the positive real axis of such a diagram, (16) shows that the vector 
for the electromotive force lies in the positive imaginary axis. That 
is, in a circuit containing only an alternator and a pure inductance, 
the electromotive force leads the current by a phase angle w/2. For 
a given applied potential, the strength (amplitude) of the current 
is given by (16) as 

< 17 > 

Suppose, now, that the inductive portion of the circuit is replaced 
by a capacity (condenser) 0. If the applied E.M.F. were constant, 
then after the initial charging current no current would flow, but on 
account of the periodic charging and discharging of the condenser, 
an alternating current results. Here again we obtain an imaginary 
resistance due to the presence of the condenser. Since the current I 
is the quantity of electricity flowing through a cross-section of the 
conductor in unit time, the charge on the condenser is given by the 
time integral^ Idt, if we start to count time at the instant when the 
condenser has zero charge. Hence the potential difference of the two 
sides of the condenser which, neglecting the resistance of 'the con- 
nexions, is equal to the electromotive force of the generator, has the 
value 


so to 


(18) 


re, since dVjdt ~ iaiV, 
...... (19) 

Hence the alternating current resistance offered by a, capacity C is 


For a purely periodic applied potential v 

y — _L_ 
iwC' 
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givers, by I fituO. In this case, since i in the denominator is equivalent 
to —i in the numerator, the applied electromotive force lags Itehind 
the current- by a phase angle w/2. ; 

By using these virtual resistances of seif- T j 

induction and capacity we can treat any ' j 

.alternating current circuit like a direct; cor- °" / t 

rent circuit. After completing the numerical | L-t? j. UK 

compactions, it is only necessary to draw Jjl. | zi 
the index diagram, for current and electro- .£_j 
motive force; this gives the amplitude and Fig , , 

phase relationships between these quantities. 

As an example, Jet us take a circuit containing ohmic resistance, 
self-induction and rapacity connected to an alternator. By Kiich- 
liofFa first law (p. 21M), 

v - F, + F* + F a - i(r + ml + . . m 

If we take I along the positive real axis in the index diagram, Y t is 
along the positive imaginary axis and F ? along the negative imaginary 
axis. According to fig. 1, the resultant electromotive force is of: mag 
nitude 


- Jr)’- 


This electromotive force leads the current by an angle <£, t 
have V = P u e‘ K ‘, and I = £>. where 


For an applied electromotive force of given amplitude F 0 , the ampli- 
tude of the current is 


.... ( 23 ) 

Taking only the amplitude into consideration, the system las an 
apparent resistance of magnitude 

'This quantity is called the impedance of the circuit. It has a minimum 
value when 

• “’-ra ia4! 
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In this case there is no difference of phase between V and /. This 
condition corresponds to resonance, and we shall find, in the next 
section, that tho frequency of the applied potential in this case is the 
same as the natural frequency of the circuit. 

In order to calculate the power, i.e. the amount of electrical work 
done per second, it is convenient to turn to the real part of the equations 
and write 

F— F 0 cos cot, I~ 1$ cob (cot — </>}. , . . (25) 
The power W, at a given instant t is 

w t = FI = F C I 0 cos«*f cos(nrf - <f>). , . . (26) 

It is more important to know the time average of the power, W. We 
need only calculate tho average over one period. Thus 

cos tot cos (cot — (cos ad cos cut cos^+ cosa>< sin ait sin <$) dt 


= | coa$ , (27) 

Hence the average power is 

W = I7 0 l it cosf ...... (28) 


The expression cos ^ is known as the power factor of the circuit. .If the 
impedance is purely inductive and capacitive (R— 0), <f> = ± v/2, and 
no power is consumed in the circuit. In this case the current is said 
to be wattless. This shows why it is better to control alternating cur- 
rents by choke coils (inductances) rather than by ohmic resistances. 

The analogy between equation (28) and the expression for mechanical 
work (foroe into distance into cosine of the included angle), which 
becomes especially apparent with the use of the vector diagram, has 
no deeper significance and is of mnemonic value only. 

Ex. 91. Calculate the impedance of an inductance and a capacity connected 
in parallel. 

Ex. 92. Show that an oscillating circuit containing an iron-cored coil m 
seif -inductance is a non-harmonic system to which the results at p. 101 may be 
applied. 

4. Noa-statiouary States (Transient Phenomena) in Alternating Current 
Circuits. 

The relationships between current and electromotive force in 
alternating current circuits which we have derived by means of the 
vector diagram refer to the steady state which sets in after some time. 
The procedisre of § 8 is not applicable to the transient conditions 
immediately following the completion of the circuit, for these phe- 
nomena depend upon the initial conditions. Since the results obtained 
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above place no constants of integration at our disposal, we have no 
way of satisfying the initial conditions. Mathematically, the solntions 
corresponding to the steady state are particular integrals of the dif- 
ferential equation, and we have now to find the general solution. 
We shall treat four different systems, beginning with the simplest and 
proceeding to the more complicated. 

(a) Free oscillations of a circuit containing only inductance 
and capacity 

Assume that the two plates of a charged condenser are joined by a 
conductor of negligible resistance. We must insert the expressions 
for the potentials due to self-induction and capacity (§ 3), in equation 
(20), p. 318, in the form of derivatives and integrals: 

L §+hf Idt==0 • < 29) 


Differentiation yields the familiar equation of simple harmonic 
vibrations 




r d 2 I .It n 

L W + C I= * 0 ’ 

(30) 

whose integral is 


I — ae^* 2 -f 6e~*"* 2 

(31) 

where 


"■ VLG 

(32) 

Using real functions, (31) 

may be written 


I = A cos 

ia»o« + 

B sinco^ or 7 = 7 0 cos [of — <j>). 

(31') 


(6) Free oscillations of a circuit containing inductance, capacity 
and resistance. . Damping 

Here we have a term R1 to be added to the left member of (29). 
Differentiating as above, we find 


This equation, too, is known from mechanics [Chap. V, § 8 (p. 95)]. 
The real form of the general solution obtained there iB, for 


1 

w > m- 


I — Ae~ Rll2L cos (a> 0 't — <f>). 


. . (34) 
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The amplitude of the vibration decreases exponentially as the time 
increases, and the vibration is therefore said to be damped. The fre- 
quency of the characteristic oscillation of the damped system, ai 0 ', 
is smaller than that of the undamped circuit: 

, _ /I # ^ 1 

w ° _ V LC 4L 2<< ° 0- vTO ' ■ ’ 35 


The ratio of two successive amplitudes, corresponding to one period, 
is constant, and is called the decrement. One usually deals with the 
natural logarithm of this quantity — the logarithmic decrement 8 
(cf. p. 96): 


S = log 


e __ R rp_ R 

e -g(t+T) 2 L 2 L 


2 tr_ 
W 0 


. . (36) 


where T is the period. If the damping is not too great we may here 
use the frequency of the undamped vibration in place of the actual 
frequency of the damped system, in which case the logarithmic decre- 
ment becomes 

S (37) 

On the other hand, if ~ 

jLC 4=1/ 

there is no imaginary part, and the solution is a purely exponential 
decreasing function 

I = ' + Be~(k~^G F^) * , . (38) 

an aperiodic system. The limiting case 1/LC = jK 2 /4L 2 is also aperiodic; 
it corresponds to the mechanical situation treated on p. 95. 


(c) Damped circuit with periodic applied electromotive force 
The differential equation is 

L d ~ + RI + ± f Idt = F 0 e'" £ . . . . (39) 

Differentiating with respect to the time, 

L W + R % +g* = ^V“‘- . . . (40) 

According to the theorem on p. 92, the general solution of this non- 
homogeneous equation is given by the sum of the general solution of 
the corresponding homogeneous equation, i.e. equation (33) above, 


(B 711) 
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and a particular integral of the non-homogeneous equation. But we 
already know a particular integral, viz. that corresponding to the 
steady state, which was derived in § S (p. 318). For 
JL 

LC iL* 

the complete general solution is 

(A cos cofi + B sin w 0 ’t) + , (41) 

V'+^-jz) 

where tan<£ = w ™~. 


The part of this equation containing the exponential factor represents 
an oscillation having the angular frequency <v 0 ' of the characteristic 
oscillation of the free damped circuit; the other term is an undamped 
periodic function having the angular frequency <» of the applied 
electromotive force. At the beginning, the resultant vibration is a com- 
plicated oscillation, which, however, soon, attains the steady state 
given by the last terra, the transient part being negligible after & 
short time. 

(d) Inductively coupled free circuits 
Of the many possible arrangements of coupled circuits which are 
of technical importance we select the simplest case for treatment here. 
Let us consider two circuits which are inductively coupled. Let the 
self-inductances be L u and L n respectively, and let the mutual induc- 
tance be Z^ 2 . Here, again, we shall carry the calculations only far enough 
to reveal the new principles involved in this type of coupled system. 

The differentia! equation fcr the first circuit becomes after dif- 
ferentiating once with respect to the time: 

(42) 


t of the second circuit becomes 
, ffiU ■ r 


■ C ' 


m 


In order to eliminate Z 3 , we differentiate equation (43) twice with 
respect to i, obtaining ^ 

, d% , r d% , ‘ 1 d?I 2 
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But from (42), § - - l (i» ® + £ 7,), . . 

. (45) 

and after differentiating twice, 

dt* L n \ 11 d* T Cj (fe 2 / 

. (46) 

Substituting these values in equation (44), and using the abbreviations 

— o-A-, 

• («) 

we obtain the fourth-order differential equation 

d*L rd%. T n ‘ 

a W j ' b W + eIl ~°' ‘ * * 

. (48) 

We assume a solution of the form 

Jj =a AeF- ' 

. (49) 

and obtain the characteristic equation 

oA 4 5A 2 + c — 0 

- (50) 

Of the four roots, two are essentially different: 


a • * 

• (51) 

.lb [¥ 3 . 

• (52) 


The other two roots differ from these only in sign. According to 
Ex. 90 (p. 310), a is always positive, and b and o are essentially positive, 
while 6 s - 4 ac is a sum of squares. Hence, since 4= 0, we always 
have 


That is, the two frequencies <a x and to 2 are always real. By elimina- 
ting d 2 Iv/dt* between (42) and (43) we see that there exist two simul- 
taneous frequencies nq and u> 2 for / a as well as for / x . Neither of these 
frequencies* coincides with the natural frequencies of the separate 
circuits. The solution for I x thus ha3 the form 

I t — Ae 1 *' 1 -f Be - *”'* f + De-™'*. . . (54) 

The co-existeace of two vibrations of different frequency implies the 
occurrence of beats, i.e. the appearance of slow variations of amplitude 
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as may be seen at once from the following trigonometric identity: 

t cos 

The right member represents a vibration of angular frequency 
(ou 1 -f- oj 2 )/2 whose amplitude is “ modulated ” (cf. p. 57) with an 
angular frequency (co 1 — oj 2 )/2. Since the circuits are assumed to be 
undamped, the energy of circuit 1 cannot be destroyed when its 
amplitude is reduced to zero, but must be transmitted to the other 
circuit. The beats occurring in the two circuits must therefore have 
a phase difference of tt/2, so that minimum amplitude in the first 
circuit corresponds to maximum amplitude in the second, and the 
sum of the energies (squares of the amplitudes) remains constant. 
This is actually confirmed by the complete calculation, which will 
not be gone into here. 

5. Resistance and Inductance of Wires for Alternating Currents. Skin 
Effect. 

All low-frequency electromagnetic phenomena in homogeneous 
media are determined by the Law of Induction (p. S12), together 
with equation (9) (p. 301 ), which was derived for steady currents, and 
expresses the connexion between current and magnetic field. If several 
different media are present, the boundary conditions of the continuity 
of the tangential components of E and H must also be taken into 
account. In view of the meaning of the curl, the above two equations 
really represent a system of six first-order partial differential equa- 
tions for the six components of E and H. Of the enormous number 
of solutions of such a system, we must select that which corresponds 
to the problem under consideration. This selection, and not the cal- 
culation of the integrals, is the difficult matter. We consider the 
following very simple case. Let there be given a very long, straight 
wire of finite cross-section to whose ends an alternating difference of 
potential is applied. For direct current, the current density would 
be the same at all points within the wire, provided the conductor is 
made of homogeneous material. For alternating current, the current 
density is greater near the surface of the wire, on account of inductive 
effects. When the frequency is very great, the current is almost entirely 
confined to th& outer layers. This phenomenon is known as the shin 
effect. 

The dependence of the current density upon the distance p from 
the axis may be calculated with little difficulty from the two differen- 
tial equations: 

curl H = —; curl E ==- curl t . . (55) 

a or c 8t x ' 


cosco^ + coscogt = 2 cos 


( <*>! — COg 
\ 2 , 


( 4 31 )'■ 
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H may fee eliminated as follows. Take the curl of the second equation 
and differentiate the first with respect to t. Since we are dealing with 
bodies at rest, the derivatives with respect to the co-ordinates (in 
taking the curl) and those with respect to the time are entirely inde- 
pendent. Hence 

S- curl H = curl ~ . 
at at 

If we eliminate this term from the two equations obtained, we find 
curl curl? = grad divt — A* == — 

Since divt = 0, this equation reduces to 

' M-ha” (56) 


On account of symmetry, the vector i in the infinitely long wire has 
the direction of the axis of the wire. We take this to be the a-axis. 
Then we may write 

i = u[p, t)k. . (67) 

We thus obtain a scalar equation which has the following form in 
cylindrical co-ordinates: 

3 2 m 1 du inor/x du 

s? + ' P r P =-*' Tt (58) 

We are looking for a solution u which is a periodic function of the 

time, and so we put 

« = /(/>)«’“* • (59) 

and obtain the following ordinary differential equation for f: 

o. .... (so) 

p dp c 2 

In this equation we set 


and introduce a new variable x = ap. Then 
dp dx 

-L.I d l 


t this is the differential equation for the Bessel functions of order 
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zero* The only one of these functions of use here — the one continuous 
at the origin — is denoted by J 0 (x). Hence 

« — AJ 0 (q.p)eh t — AJ 0 ^ V — i VATrapco , . (64) 

where A is the amplitude factor. 

The real part and the imaginary part of the function J 0 {x\/—i) 
occur in various problems and are tabulated a! new functions of x. 
They are known as “ ber (x) ” and “ bei (») ” respectively: 

(#e)[J 0 x\/--iJ -- berx j 
(/m)[J 0 *V --»] — beix J 

Using these functions, our solution may be written 

u= A | ber VAvopco ~ ^ + i bei (\/ 4 iropco ~ ^ j 

or u~ A Artapu) bei 2 (^\/ 4 rtapev 
bei 

where tan$ = — 

ber 

That is, not only the amplitude, but also the phase depends upon the 
radius. For large values of p (iron 
wire) both effects can be shown 
even for currents of very low fre- 
quency (1 oycle/sec.). One can 
obtain a picture of the. process 
by drawing the vector for the 
current density corresponding so 
various values of p (fig. 2). In 
order to determine the amplitude factor A we have to integrate 
* over the cross-section, which gives the total current as a function 
of A, so that A in turn may be evaluated in terms of 1. 



(v irrap/jj 


(65) 

( 6 «) 

( 66 ’ 



CHAPTER XVII 


Rapidly Alternating Electromagnetic Fields: 

I Propagation in Homogeneous Isotropic Media 

I. The Electrical Analogue of the Law of Induction. 

The La w of Induction states that, in a vacuum, any change of the 
magnetic flax through a surface bounded by a curve results in the 
formation of an electrical “ vortex ”, i.e. a finite value of the iia-.s 
integral £ E<1$. The reciprocal relationships between electrical and 
magnetic phenomena already encountered many tames would lead 
us to expect that a change in electrical flux similarly would cause a 
magnetic “ vortex ”, Thus far we have encountered magnetic curl 
only when conduction currents were present. We write m a tentative 
form of relationship, for a vacuum, 

1 3E 3S ,,, 

curJH -- - or lM-K.SriJL.c-. , * * (1.) 

6 at ' '• dt ’ 

This equation was derived by Maxwell from the notion that the volume 
elements of the “ ether ms polarized when p. field is applied, just as 
are those of a material dielectric (p. 279). It may be seen readily that 
electric charges are displaced when a volume element is polarised, 
and hence that a current flows while the polarization is being brought 
about. 

Imagine a small cylinder of unit cross-section and altitude dh in 
which a polarization P is generated parallel to the axis. The apparent 
charges on the transverse surfaces of the cylinder amount to cr + =» -j~ | P j 
on the upper side and o.„ ~ • j P j on the lower, according to p. 282 . 
If P changes by dP, the density of charge is altered by -j- } dg j and 
— | dS j respectively. If this charge is to reach the bounding surface 
it must pass through the cross-section immediately below or above. 
Hence a positive current dP/rft flows in the upper half and a negative 
current — dPjdt flows in the lower half, but in the opposite direction, 
sc that a “ displacement current ” of 

1 dP 
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electromagnetic units flows during the time the polarization is chang- 
ing. But Maxwell’s concept of a polarizable ether built up of electric 
charges is no longer tenable, and we are justified in assuming the 
existence of a true displacement current only in the case of material 
dielectrics. Hence* we have thus far no rigorous basis for the term 
1 

c dt’ 


especially with regard to the sign. We shall therefore justify it for the 
present only by noting that all its consequences are confirmed by ex- 
perience. Later on we shall see how the field equations may be de- 
duced by a method which is based on very few hypotheses. Hence in 
■ a conducting dielectric medium we have three contributions to the 
current producing a magnetic field, so that we may write in general 


4w« , 13E . 4tt8F ini , 1 dE , 4w 0E ... 

ou I lH---+ o -j i + TTi =.- T + J Tl + T « W (3) 


The last two terms are usually combined into — ^ , so that we write 

curlH== f W + 7 aEi ■ • * • • (2a > 

or, in M.K.S. units, curlH = KK 0 ^ + oE (26) 

Following Maxwell’s ideas, we refer to the term 
KdE 
c dt 

as the displacement current, to distinguish it from the actual conduc- 
tion current. The above equation (2a), together with equation (3a) 
(p. 312) [or (26) and (36)], constitute Maxwell’s equations for the electro- 
magnetic field. 


2. The Wave Equation for the Propagation of Fields in Dielectrics. 
For isotropic dielectrics at rest the following equations hold; 


Kd E 
= c dt C 




• (I) 

. (n) 


If, as we shall assume, no volume charges are present, 


have in 
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addition the equations of continuity of the electrical and magnetic 
fields, viz. 

divE = 0 and divH = 0. . . . . (HI) 

Either E or K may be eliminated from the Maxwell equations by the 
method of p. 325 . If we ohoose to eliminate H, we take the curl of 
the second equation, while we differentiate the first with respect to t. 
Using (III) we then obtain 

. Kp 3 a E „ v 3 2 E 

g-g , or AE = KK 0 pp 0 . . . (3) 


This is the differential equation of a wave-like propagation of the 
components of E. By eliminating E, we obtain an exactly similar 
equation for H. Of the tremendous manifold of phenomena described 
by this differential equation we now consider the simplest case — that 
of a plane wave. We understand this to mean a wave for which all 
quantities depend only upon one co-ordinate, say x. The components 
of E have constant values in planes normal to the x-axis. Mathemati- 
cally, this means the vanishing of all derivatives d/dy and 3/3z. Hence 
from (III), 

3 B m , 32?, , 3 E,_ 


divE = 


it follows that 


"3aT + ~dy 
3 E„ 


+ ^ = °» 


Hence the electric field can have no component in the direction of the 
x-axis. Since we are dealing with a wave motion, a superimposed field 
which has the same value at all points does not interest us here; we 
may conclude that equation (4) implies that E x — 0. In the same 
way we find that H x = 0. Hence the vanishing of the divergence of E 
and H implies the transversalUy of the plane watte. 

We obtain the equations 


3 2 E y _ Kp 3 2 E V , 3 2 E % _ Kp 3 2 E t 
3x 2 ~ c 2 3 t 2 3x® ” V 3 1 2 ' 


• ( 5 ) 


for the components E v and E,. We assume a simple periodic depen- 
dence on the time: 


E v = /(x)e*“‘ and E t = ( 7 (x)e i( “ , ~ 4) , ... (6) 


where the phase constant S is included to ensure generality, 
by (5) 


d 2 / , a > 2 Kp 


'/= 0 and 




Then, 


( 7 ) 
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The integrals of these well-known equations of simple harmonic vibra- 
tions are /= Ae±‘°‘ vy &*l‘ and g = Be*™ v-X?*. . . (8) 

Using the negative sign, we then have 

E v — Ae‘ u(t ' '/&*!*) and E, — J^**-* / Xm*/*>-<*. . (9) 

This is the equation of a plane wave propagated in the direction of 
the positive a-axis {the negative sign would signify the opposite direc- 
tion) with a wave velocity 

v — cjVKjl, or v = l/VKKfffjjXfj, . . „ (10) 

In the CALS. system, v — c for a vacuum, so that the factor c introduced 
on p. 300 has a highly significant physical meaning. It represents the 
speed of electromagnetic waves in a vacuum, as anticipated there In the 
M.K.S, system c = lj\ /r K 0 iJ. a , as is evident from the dimensions of 2? 0 p 0 . 

E y and E % are independent of each other, so that the phase difference 
is arbitrary. The most general form is thus the eiliptically polarized 
wave in which the end point of the vector E describes an ellipse at any 
given place; if this motion is confined to a straight line we have a 
plane polarized wave. 

In order to calculate the components oj H we are net permitted to 
start with the wave equation for this quantity; this would bullish us 
with an excess of constants. We must observe that E and H are con- 
nected by the field equations. For E t — 0, the second one gives 

d JLi „ o, ! - a iutAtH* (11) 

at c at ox c 

Integrating, 

i?,= 0, H, -~ v «/«> * /K Ey. . (12) 

V « V M 

Since we are no more interested in a temporally constant field than 
in a spatially constant one, we may set the integration constants of 
(12), which are independent of. timo (functions of x) equal to zero. 
Thus we have + s t H, — 0, 

so that E is perpendicular to H in a plane polarized wave. Let 
us designate the direction of propagation by the unit- vector n. 
Then E, H and n form a right-handed orthogonal triad, in this order. 

Electromagnetic waves embrace an enormous range of physical 
phenomena, the essential unity of which remained um-icoguizod for 
many years, chiefly because the methods of production and the effects 
of the waves differ widely according to the value of the frequency v 
(or the vacuum wave-length A =e c/v), as the accompanying short 
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table (fig. I) shows. The entire range, with certain overlappings of the 
methods of generation and of investigation, is now known without 
a break. 

In optics, the ratio 
of the phase velocity c 
in vacuum to that in 
any other medium is 
known as the index of 
refraction a of the 
medium. Thus, by 
equation (10), 

J (13) 

This equation is known 
as Maxwell’s relation. 
But, as even the familiar 
example of water {K = 
80, p tv 1, n = 1-33) 
shows, this relation is 
not always satisfied if 
we take n to be the 
optical index of refrac- 
tion, If (13) were satis- 
fied down to arbitrarily 
short waves there could 
be no dispersion at all, 
i.e. no variation of n 
with wave-length, if 
K and p are not depen- 
dent upon frequency. 
The discrepancy is explained by the fact that the polarization of a 
medium (which determines the value of K) becomes dependent upon 
the frequency for rapidly alternating fields. Down to wave-lengths of 
about 1 cm. the static value of the dielectric constant may be used 
for all substances, • For shorter waves (higher frequencies) dependence 
on frequency comes in. This -will be explained in the second part of 
the Theory of Electricity (p. 452). 

3. The Poynttng Vector of Energ- Flow. 

The energy density in the electromagnetic field has the value 
« — ^ (K& -f /uH 2 ), or u — i(KK 0 E 2 + pp 0 H 2 ). . (14) 

Since by' (12), ,| H | /?. | E j or = i E i 

V f 4 \ P-Po 


i' hibU Rejwn, _ 
IJUrn -Hotel - 


X( cm) 

\-m t 

-to* WtreUxs Hfam 


Short Electric Mfoes 

i— infra-red Bg-vapaw Emission, 
— H-/0* JtesiduaL Roys 

l0 * "T Nm." Infra-red 
-ter* _L, visible Region, 
j i ziftfltm-Holri Region, 

'i fT LymeoiMtllikan Region. 


-^1 
Trays I-** 


X-rays 

y-rays 
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for a plane wave, /£E 2 — pH 2 , or KKqE 3 = /r/t 0 H a . , , . (15) 

Tims tbe energy is half electrical and half magnetic in this case. Now 
we know that energy is transported by every electromagnetic wave. 
In order to verify this experimentally we need only convert the energy 
into heat at a 'given point and determine calorimetrically the rate at 
which energy is being delivered. If the wave advances with a velocity 
v, th,e amount of energy streaming through unit area of a surface 
normal to the direction of propagation in the time dt is that residing 
in the space within a rectangular parallelepiped of 1 cm. 2 base and 
altitude vdt, i.e. 

+ (16) 

We may therefore represent the energy flow by a vector in the direc- 
tion of propagation n whose magnitude is equal to the energy flow 
per unit time. This is the Poynting Vector 

S = — [EH], or 8*** = [EH], . . . (17) 

so named after its discoverer. Since E and H are periodic functions 
of the time, the vector S at a given place will also experience 
periodic fluctuations between zero and a maximum value. A 
change of sign never occurs, however, since E and H are always 
in phase [cf. (12)]., The Poynting vector is always in the direction 
of propagation. 

The Poynting vector S has a much more general significance than might be 
expected from the way it was introduced. We never come into conflict with 
experience by assuming that this veotor may be applied in all cases. It holds 
for all simultaneous electric and magnetic fields — even for steady fields. For 
example,, consider a cylindrical wire bent into the form of a circle, the ends being 
joined to a source of direot electromotive force. Within the wire, the electric 
field is along the axis, while the magnetic lines of force are circles whose planes 
are normal to the wire. Hence the energy flux vector at the surface of the con- 
ductor is normal to the surface and, moreover, is directed inwards no matter 
which way the current flows. Thus there is a continuous flow of energy from the 
surrounding dielectric into the wire, and it is readily shown that the amount of 
energy entering unit length of wire each second is exactly equal to the Joule 
heat developed in that portion of the wire per unit time. The ultimate source 
of energy is the battery. From the atomic standpoint, however, this interpretation 
sounds artificial. 

Ex. 93. The amount of solar energy received by the earth is about 2 cal./ 
min. /cm. 2 . What are the amplitudes of the electrio and. magnetic fields of the 
radiation ? 
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4. Propagation of Electromagnetic Waves in Conducting Media. 

For media in which there exists a non-vanishing conductivity the 


field equations are 

-—7 "+?* ® 

curlE — e I?. (II) 

divE=0, divH— 0 (Ill) 


If H is again eliminated by the method of § 2 (p. 329), we have the 
following equation for E: 

. Ka 9 2 E . 47rcru 3E 

AE = ^ + -/a7' 


j-r ■fp’ 3 2 E 0E 

AE = KK 0 ^ + ° 


(18) 


We again consider plane waves propagated in the direction of the 
positive x-axis, and set 

3 . d 




= 0, 


from which it follows that E a — H x = 0. Assume again that E„ and 
E t are simple periodic functions of the time: 

E v = /(x)e“‘ and E t = g(x)eh*. . . . (19) 


For simplicity of notation, we restrict our discussion to plane polarized 
waves whose electric vector is in the xy-plane; hence we put E, — 0, 
By substitution in the wave equation we obtain the following dif- 
ferential equation for the point function /: 

g + (W. f ^ /=0 , . . . (20) 

the solution of which is 

/(as) « A±i v KTO^»/ { (21) 

If a = 0, we again have the case of an undamped wave. In this 
stance we must choose the negative sign for a wave propagated in 
positive x-direction; hence the negative sign is to be used here a 
We now introduce the following notation for convenience: 

■\ZKfui ) 11 — i . iimijua — um (1 — ix). ... 


( 22 ) 


r g? 
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Squaring and equating reals and imaginaries separately, we obtain 
equations for n and k : 

_ a 2 K 2 = Kft 1 
i&k = — ~~ ~ j * * * ' ' (23) 


where T is the period of vibration. Solving simultaneously we obtain 

n~J%(V&+**T k +K) ) 

{ }'.••• < 24 > 

n k- - J* (VWTTa^ - K) J 

Using the notation of (22) in (21), equation (19) then gives 

B„ — Ae ia,t e-*"* K ' c e~ an * ,e — Ae -**»*/« e M ‘- nx,c K . (25) 

The imaginary term —iruc is thus seen to furnish a real negative 
exponential function, i.e. a damping factor, while the velocity of 
propagation of the phase is determined by n. Thus the use of this 
same n for the index of refraction in insulating media is justified. 
The meaning of k is as follows. Since 

nai 27ry _ 2 t r 

~e ~ ~v~ A ’ 

the damping factor becomes e -2***/\ 

For sc = A, the factor becomes 


That is, in travelling a distance equal to the wave-length in a vacuum, 
the amplitude diminishes by a factor 


We now calculate the vector H, using equation (II) (p. S33), As 
in the case of insulating media, only a 2 -component of H remains, 
Using (25), this is given by 


BE, ^ __ o cE y 
dt t* ax 




<"* 


-f- in) Ae~' OH 




- n(I — be) Ae~ ‘ 


-»(!—«)#„. (27) 


Now from the relationships in the complex plana, 

i — %k = yl -j- k* s“ ’ r , and tan y = k. 


. ( 28 ) 
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Hence 

H, = - VT+k* . E v e~ iy = n Vr+ K 2 Ae~ u ” ,nxic ««*-»*/*>-* (29) 

f i p 

Thus we find that there is a difference in phase between the electric 
and magnetic vectors in conducting media: H lags behind E by the 
angle y. According to the discussion in this section, every conductor 
should absorb electromagnetic waves, while all insulators should 
transmit them. However, we can give at once examples which con- 
tradict this: A solution of Nad, in spite of high electrical conductivity, 
is perfectly transparent, while vulcanite, which is a good insulator, 
is opaque. The explanation is again, as in the case of the index of 
refraction, variation with frequency— here, variation of the con- 
ductivity. In calculations we may U3e the static value of the conduc- 
tivity down to a wave-length of about 1 cm. for all substances. A 
possible solution of the difficulty will be discussed in th6 second part 
of the Theory of Electricity {§ 2, p. 451). 

6. Hertz's Solution of the Field Equations. The Hertzian Oscillator. 

Up co now we have considered plane waves without inquiring into 
their origin. The waves actually come from spatially limited sources, 
whether they are wireless waves emitted from an antenna which is 
small compared with the distances spanned by the signals, or light 
waves emitted by tie radiating atoms or molecules of a gaseous source 
of light. Eor this reason, the waves are approximately plane only at 
grea.t distances from the source. Hertz found a solution of the 
held equations which provides a unified representation of source and 
radiation field in che whole of space. If we confine our attention to 
a vacuum (free space), the appropriate field equations are 

curl H = ^ ....... (I) 

in 19H , TT . 

(II » 

div E = 0, div H = 0 (Ill) 

In determining the magnetic fields accompanying steady currents, 
it proved expedient to express H as the curl of an auxiliary vector V, 
the vector potential, which bore a simple relation to the distribution 
of currents. We proceed here in a similar manner. Tbe formulas be- 
come appreciably simpler if we choose as vector potential the time 
derivative of an auxiliary vector 25 and supply also the factor 1/c. 
We thus put 


H= - 


3Z 

dt ’ 


(30) 
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Since the order of the differentiations can be reversed, 
write 


H = (enriZ). 


we may also 
. . (30') 


Substituting in (I), and integrating with respect to t, 

E = curl curl Z = grad div Z — AZ. . . . (31) 


In order that (II) may be satisfied we must have 

curl E — — i curl . . . , . (32) 

Integration yields E = — ^ + grad <f>, .... (33) 

The arbitrary vector grad<£ appears in place of a constant of integra- 
tion, since the equality of the curls of two vectors implies only that 
these vectors differ by an irrotaticnal vector grad<£. The two ex- 
pressions (31) and (33) for E will be identical if we take 

<f> = div Z (34) 

, 13 8 Z 

Az== c*af (35) 


Hence, as may easily be verified independently, tire values of E and 
H defined in terms of Z by (30), (33) and (34) satisfy (I), (H) and (III), 
provided Z is a solution of the wave equation (35). 

We no longer seek the solution corresponding to a plane wave, for 
this could give no singularity at the origin; rather, we attempt to 
find the solution corresponding to a spherical wave, in which the sur- 
faces of equal phase are spheres. For this purpose we write AZ in 
spherical polar co-ordinates, and assume that Z is a function of r only: 


9Z , 23Z_ 1 0*Z 
dr 2 ' r dr <? ' 

Assuming a simple periodic variation with time, we write 
Z = F{r)eh' ...... 


(36) 

(37) 


and obtain the equation 


cPF 2 <£F ej* 


F = 


0 . 


. ( 38 ) 


for the point function F. 


The integral is 
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as may be seen by substitution, or by writing the equation in the form 


The negative sign corresponds to a wave spreading outward from the 
centre r— 0. We then have 




■ (40) 


where p(t) replaces P^'- This spherical wave differs from a plane 
wave in that the amplitude decreases as 1/r, corresponding to the 
spreading of the energy over a sphere of increasing size. The field 
vectors E and H are obtained from Z by simple differentiations. Start- 
ing with H, the simpler of the two, we have 

curl Z = curl {pf(r)} = [grad/(r), p(t)] 

= <«) 
where r 0 is a unit vector along r. From (40), 

. . . 

dr \ cr r 2 / 

Hence H = ^ ~ curl Z = ~ + '—j e i ** ( ‘- rW [p o r a ] 


(42) 

(43) 


The vector H is thus seen to be composed of two parts, similarly directed 
but decreasing according to different powers of 1/r. We can locate 
the regions where one or the other term predominates by writing 

•<«> 

The scale of comparison is thus given by the wave-length. In the 
near zone (r <C A), the first term predominates; in the distant zone 
(r» A) — the actual radiation field — the second term controls the 
situation. Between the two is a region where both terms are of the 
same order of magnitude. We investigate the near zone first. Here 
we have 

H' - 7 ' ?&•-! '•] < 45 > 


If we compare this expression with the Biot-Savart formula for the 
magnetic field due to an element of a current-carrying conductor 
(p. 303), we see that, apart from the factor 
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the field in the local region is that due to a current element 

Ids ~ ~J~- = iwfiffi'" 1 , (46) 


The factor 


corresponds to the finite velocity of propagation of the magnetic field, 
which causes a phase displacement, increasing with distance from the 
centre, with respect to the instantaneous value of the current element 
(“ retarded field ”). In order to gain a conception of such an alter- 
nating current element, we consider an electrical dipole whose length 
and direction are given by 4 and whose ends carry periodically 
oscillating charges 

q « ........ (47) 


The moment of the dipole will also be a periodic function of the time 
^ = g/=? 0 ^4 ....... (47') 


dt dt 


• • (48) 


But dqldt is the current flowing as the charge changes. Our solution 
thus gives the magnetic field of a dipole whose axis is fixed and whose 
moment <p is periodic. Such a dipole is called an electrical oscillator or 
Hertzian vibrator. Later we shall find that the electrical field in the 
near zone also corresponds with the electrostatic field of a dipole of 
periodically oscillating moment. 

A dipole of this kind may be represented by a short linear antenna 
in which high-frequency oscillatory currents are induced, say by 
inductively coupling tire antenna to an oscillating circuit. In con- 
trast with steady currents, where the impossibility of isolating indi- 
vidual conductor elements made us reject the Biot-Savarfc law as a 
fundamental basis, current elements are experimentally attainable 
when we deal with oscillations. However, the magnetic field of such 
a segment is given by the Bioi-Savart rule only in the immediate 
neighbourhood of the conductor,; and then only to a first approxi- 
mation. According to the theory of action at a distance, we should 
use this rule at arbitrarily great distances from the conductor; this 
would yield results which are entirely contrary to experience, for at 
great distances (i.e. in the " radiation field *'), the second term of (44) 
controls the situation. By (43) this gives 

- 3? • j tA*-.) - + s [w <49 > 

This magnetic field which decreases as 1/r is no longer proportional to 
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the first derivative of p — which by equation (48) is proportional to 
the anterma current — but to its second derivative, i.e. to the rate of 
change of the current. 

In order to calculate E it is best to start with equation (31) (p. 336): 

E=graddivZ — AZ. ..... (31) 
We also use the fact that Z satisfies the wave equation, i.e. 

E-graddivZ-^ ~ (3V) 

Now by equation (40) (p. 337), 

div Z — e*"-' div ^p 0 - ^ = e*“‘ di v { p 9 /(r)} — e*“‘ p o grad / (r) 

whence 

i ( 51 ) 

where p a r in the first term has been replaced by /> 0 r 0 r in order to 
make clear which powers of r occur. We also have the expression for 
dfjdr given by equation (42) (p. 337): 



dr 

(-$-})* • 

. * • (52) 

and, farther. 

*/__/ 

to 2 2ta> . 2\ 

- — -t 4- — ) 

. - • (S3) 


dr* \ 

c*r or* ‘ r 3 / 

We also have 

1 0 2 Z 
c 3 dt* 

— ■ 

. . . (54) 


The field in the near zone is governed by the highest power of 1 It 
only. Since r 0 is a unit vector, this is the third. But this power does 
not occur in the term 

1 3 2 Z 
? dt*’ 

so that the electr ic field in the near region is derivable from a scalar 
potential 

Vy — — divZ . (65) 

Neglecting the lower powers of 1/r, this quantity becomes 

7* = ^ . . . (56) 

But according to equation (28) (p. 273), this is precisely the potential 
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of a dipole of moment fi, taking into consideration the variation of 
phase with increasing distance caused by the finite velocity of pro- 
pagation. This is then the so-called “ retarded potential ”. 

To determine E in the radiation field we must collect the terms m 
1/r. Since r 0 ®= 1, this gives 

*•= &**-"* (57) 

Taking the axis of the dipole to be the 
polar axis, H has the direction of the 
latitude circles [cf. equation (49)], and E 
is along the meridians [equation (57)] in 
the far region. E, H and the unit vector 
r 0 in the direction of propagation again 
form a right-handed system (fig. 2). In 
contrast with the phase, the amplitudes 
of E and H are not constant over the 
surface of the sphere, but decrease toward 
the pole (0=0) as sin0, on account of 
tbe factor [^ 0 P* 0 ]- This means that a 
Umar oscillator emits no radiation in the 



direction of its axis. This further implies that on account of the vanish- 
ing of the radiation term near 0=0, the contribution of the term for the 
near zone may not be neglected entirely, in spite of its decrease with 1 /r 3 , 
‘since it does not vanish for 0 = 0. We are therefore not free to draw 
the meridional direction of E quite to the pole. The contribution of the 
first term causes the lines of force to become rounded at the poles, giving 
Hertz’s celebrated instantaneous picture of the lines of force (fig. 3).* 

* Consult H. Herts, An n. Phyt. Chem., 86. p. 1 ( 1880 ), or bis Oesummelte Wtrht, 
U, p.146. 
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If we now consider the energy flux vector S, which is in the direc- 
tion of r 0 , its instantaneous value is given by c/4-rr times the product 
of the magnitudes of the mutually perpendicular vectors E a and H„. 


Passing to the real functions, these are 

I H * I = . . . (58) 

Hence we have 

i s i = £ • S ^ p ° 2 sine0cosico {(*-')}• * * (69) 

Since cos 2 ^ = 1/2, the time average is, 

1 * 1 - 8 ; «*» 


In wireless telegraphy the energy radiated in various directions is 
conveniently represented by a polar diagram in which the length 
of the radius vector p is taken equal to the radiated energy, on some 
suitable scale. The equation of such a curve for the linear oscillator 
is, from the above 

p — p 0 sin a 0 (61) 

The ayerage energy which passes through a sphere of radius r each 
second is obtained by taking the surface integral 

<j) 8dA—-^ £p.p 0 *.2un*f sin 2 6 sin 0 d6>= ~ ^ 2 tt . f p 0 2 . (62) 

This gives the energy lost by the source of radiation each second: 

(63) 

This expression can bp given another form by inserting the current 
amplitude Z 0 in place of the amplitude of the dipole moment. Accord- 
ing to equations (46) and (48) (p. 333), the magnitude of Z 0 is cupjds. 
If we put the length ds of the element (the antenna) equal to l. 



This loss of energy by radiation is proportional to the square of the 
current, like the ordinary Joule loss. With an ohmic resistance R, 
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the same alternating current would have an average energy loss in 
the form o£ heat amounting to 



The ohmic resistance which would cause the same loss of energy as 
the radiation is called the “ radiation resistance M R„ of the system. 
According to the above 

R b — ^ ( 1 ^ electrostatic units, 

or SOrr 2 ohms (66) 

It is seen that even for antenna lengths of but 1/10 the wave-length, 
the radiation resistance is of the order of 10 ohms, i.e. large compared 
with the ohmic resistance. 

Ex. 94. Show that the phase velocity of H is greater than a in the transi- 
tional region between the near and distant zones. 



CHAPTER XVIII 


Electromagnetic Waves: II. Phenomena in Two 
Adjoining Media 

1. Unified Rigorous Derivation of the Field Equations and of the Bonn* 
dary Conditions. 

In setting up the electromagnetic field equations we started with 
common facts of experience. In proceeding thus, we were unable to 
give a rigorous proof of the electrical analogue of the law of magnetic 
induction, equation (1) (p. 327); moreover, the boundary conditions 
were derived only for static or stationary fields. Hence before pro- 
ceeding further, we shall explain a more satisfactory way of arriving 
at these results. The proposed method is based upon five hypotheses 
which, it must be admitted, are pot as immediately obtainable from 
experiment as are those which formed the basis of our previous method. 
However, the following procedure has the advantage of generality and 
compactness. The five assumptions are the following: 

X. The expression for the density of electrical energy is 

“““I 61 

2. The expression for the magnetic energy density is 



3. The energy flux is given by 


» = £[EH]. 


< 3 ) 


A The expression for the electrical energy converted into Joule 
heat per second per cubic centimetre is 

M 


5. The experimental fact of the linear superposition of electro- 
magnetic fields, which requires that the field equations be linear and 
homogeneous in E and H.* 

* Wa omit from consideration here the special case of ferro-magnetio media, where 
non-linearity of the equations results from the fact that p itself depends upon the 
value of the field. 

848 
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These five postulates emerge in a plausible way from general con- 
siderations if we seek the simplest and most direct dimensionally cor- 
rect relations between the quantities involved. This is essentially what 
was done on p. 288 in obtaining an expression for the energy density, 
The field equations for a uniform medium follow at once from the 
following energy considerations: the decrease per second of the elec- 
tromagnetic energy within any volume is equal so the sum of the 
energy converted into heat and the energy which flows out through the 
bounding surface, i,e. 

H^3r 

dtJ \§n P 8* t) J\ 4 v dl 4t r dt j 

*- J aE 4 afr-f [EHJrfS. . (5) 


If the surface integral is converted into a volume integral by means 
of Gauss's theorem, we -have, since div [EH] = H ourlE—- E curlH. 

[ ( KE~ + pH — -f ina& 4- jH curlE— cE curlH \ dr= 0. (6) 

J \ dt at f 


Since the integral is zero for any arbitrary volume,, the integrand must 
vanish. 'This is assured, and at the same time Assumption 8 is satis- 
fied, by setting equal to zero the factors by which B and H are scalariy 
multiplied. From this it follows that 


curl H — 


K d E 
c dt + 


o-E ~ 


1 ?D , 4 w 

o dt + 7 


il) 


curl E 


p d 'SL 
C it 


m 


Idle boundary conditions are obtained from the obvious condition 
that ax- she surface of separation of two media the energy leaving the 
first medium must enter the adjoining one, since for finite conduc- 
tivity no electrical energy can be dissipated as heat in the roaihe- 
'■natical surface of separation. If we take the normal to the interface 
ts B-aads, 


g,<» - ± [Bt»B«], 0 S«> «. [B(» . {?) 

4 n 4ir 

SJ'WJV - 8 m ' l WJ'> = — RWE*>. ... ( 8 ) 


This condition is secured by taking for the equations to be satisfied 
at- the bounding surface * 

•The uutia.ee conditions (IV). are uRually deduced directly from the fundamental 
equation. (1) aad (U) above. Wo take a email rectangular area having two 
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TWO ADJOINING MEDIA 
Bjto = 3™) 

BJV = BJ» . a.w— zr,®f * 

and *//«»; 

or using the concept of surface curl, 

Curl E = [« (Ii. — E,) ] = 0, OorlH = 0. . (IV') 

The relationships governing the normal components are already Used 
by this condition of continuity of the tangential components, since 
E and H must, in addition, satisfy the field equations in each of the 
two media. The general validity of the continuity of the normal com- 
ponents of D — ITS and of B — pS. is readily demonstrated by a 
simple computation naftig (IV) and the field equations. This also 
implies, of course, that the normal components of 1 and H them- 
selves suffer discontinuities at the interface. 

2.* Waver in the Boundary Layer, The Ground Wave. 

The Hertzian solution, which includes the antenna as well as the 
radiation field, fails to represent actual conditions ui this respect, that 
the transmitter is not suspended in space, but is located at the boun- 
dary of two media having quite different electrical properties. We 
must therefore find two solutions which satisfy the boundary con- 
ditions as well as the held equations — one for the upper medium (the 
air), the other for the earth.' The problem, replete with difficulties, 
was solved in a series of papers by Sommerfeld We restrict cor atten- 
tion to the simplest case- -the demonstration of the possibility of a 
kind of piano wave having its greatest amplitude at the bounding 
surface, ana hence referred to as & surface wave or boundary layer 
wa re. The consideration of this wave apart from the total phenomenon, 
it is true, is somewhat arbitrary. 

We take the surface of separation to be Che ay-plane of a rect- 
angular system of co-ordinatos, the direction ox propagation being 
along the positive a-axis. For a plane wave in a uniform medium, E 
as well as K would be normal to the direction of propagation. But 
the boundary conditions cannot be fulfilled if we make such an assump- 
tion for two media. We therefore generalize the hypothesis m two 
ways: first we assume the waves to be non-uniform, i.e. the amplitude 
is to bo a function of z, the distance from the interface. Secondly, we 

of its sides normal to the surface, and the other two (which are made much longer) on 
opposite sides of the surface and parallel io it. On the assumption that- the.-e are no 
surface currents,, electric or magnetic, it follows at cnee from I and TI respectively 
that the taugentia! component of the magnetic vector and that of the oiectno veotor 
arc continuous across the surface. There is one importrnt esse in which surface 
electric currents are present, viz. the case of a perfect conductor (that is, speaking 
optically, a perfect reflector); in this case the sole condition is that the tangential 
conaponsnt of ohd &Ieotyic vector vB-oiahsJs, 
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• • (IV) 
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assume that H, which we take in the ^-direction, is representable by 
a purely transverse wave. It will be seen later that E, on the contrary, 
has an ^-component in addition. Thus for H opr hypothesis is 

H = H v j, H t = .... (10) 

We leave the phase velocity of the wave still undetermined; it is to 
be expected that this will be determined by the electrical constants 
of the two media. With the above assumptions (10) concerning H, 
the field equations give for the components of E: 

-f S #+ 1 F®- • • • <u> 

o-f W + *T S " • • • < 12 > 

_ E -f. tz? E v ... (13) 

V ' 0 ot c 


We sec from (12) that E v may be set equal to zero; on the other hand, 
the ^-component of E does not vanish. 

The Maxwell equations (II, p. 344) also indicate that E is indepen- 
dent of the y-co-ordinate. The second of them yields the equation 


BE X dE, _ ym> - 
dz dx o J 


(14) 


We now assume non-uniform waves, propagated in the x-c jction, 


for E u and, E t also: 

= (15) 

E c = (16) 


Substituting this value of E x in equation (11), we have 

-4 

< 17) 

icoc^ 

From (13) and (16), *- 5 ^ 3 - (18) 

And finally from (11), (13) and (14), 

■ ■ <> 9 > 

/= Ae», 


The solution is 


( 20 ) 
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where , - ± - % ) + ^ i. . . (W) 

This solution has the same form for both media. But since solutions 
increasing exponentially with z have no meaning, we must choose the 
sign before the radical in such way that for z > 0 (i.e. in air) the 
real part is negative, while for z < 0 (i.e. in the earth) the sign must 
be so chosen that the real part is positive. Denoting all quantities 
pertaining to the upper medium (air) by the subscript 0, the solution 


for this space becomes 

fir v (o) = .... (21) 

and in the ground: H v — Ae +I ‘*e? al, ~ xlv ' 1 (22) 

Also, by (20'), 


o» 2 _ a _ goMo . .2 

V Po ~ c 2 c* 

The boundary conditions expressing continuity of the tangential 
components of E and of H, which must hold at every instant, can be 
fulfilled only if the phase velocity is the same on both sides of the 
interface, he. 

( 24 ) 

Further, for z — 0, = H v yields the relationship 

A = A 0 . (25) 

The tangential component of E is obtained from equation (15), using 
(17) and (20) and attending to the remark made above as to the sign 
of p: 

E. (0) = + 

The boundary condition thus yields the equation 


47tct 0 + 47 ra -f- Kan' 

This equation, taken along with (23) and (24), suffices to determine 
the remaining three unknowns p 0 , p and v. If we simplify the right side 
of equation (23) by introducing the notation 

, . ( 28 ) 


cpgAe-^t E _ cpAe +p *& a ( t ~* M 

imJ 0 + K 0 u)i ’ * 47ra + Kcoi 
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and if we call 

hires' + Eari 

4va 0 -f- K 0 iui 

<1, ‘ . • • • 

. m 

ws have 

! 

! 

i 

. m 

From this, 

Wnv! 

bo 

y 2 ’ 

. ( 31 ) 

and then from equation (27) 




P^—Pei- 


. (32) 

Finally, from (30) and (31), 




L+li-zA. 

* V l-« 2 

• (33) 


It is to be remembered that all three quantities a 3 , p and v are com- 
plex. By separating the real and imaginary parte of the exponential 
function it may be showy that the solution has the form 

3j» = . . . (34) 


From this result we see that 

1. The amplitude of the waves decreases with the distance e from 
the surface of separation. 

2. The waves suffer a spatial damping in the »-dir§efcien also. 

3. The planes of equal phase are not those for which a — sonata 
but are the oblique planes xju -j- s^sjta ~ const. 

By substituting the solution in equations (15) and (16) ws find 
further that the x- and s-ooinponents of 12 show a difference in phase, 
so that the end point of the vector £ moves on an ellipse, not along 
a straight line. The plane of tills ellipse, however, is not normal- to tbe 
direction of propagation, as in t ha case of eiiiptieaUy polarised free 
waves, but contains that direction. The above analysis does not 
correspond entirely to the general physical problem, inasmuch as we 
have dealt with piano waves only, without, taking the scarce into 
consideration. The natural extension of the theory consists in seeking 
a solution corresponding to the Her bz solution for both media, taking 
the boundary conditions into account, Sommerfeld’s papers deal with 
this problem, but the details cannot- be given here.* 


* An abridged treatment i» given in Prank-Mises, Die Differential und Integral- 
gleichungen der Mcchanik und Physik, II. Tell, p. 641, Braunschweig, Vieweg & Sohn 
(1027). Consult also W. H. Eceles, Wireless Telegraphy and Telephony, p, 169, Van 
Nostrand Co., Now York (19X6). 
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3. Consequences of the Boundary Conditions for Insulating Media. The 

Optical Laws of Reflection and Refraction. 

The considerations of § 2 with regard to boundary-layer waves 
are not applicable to the short electromagnetic waves of the optical 
range, for the reason that each radiating atom is so far from boundary 
surfaces (compared with the wave-length) that it may be considered a 
free radiator in a uniform medium. However, if a' wave sent out from a 
radiating atom encounters tbe surface of separation of two different 
media, part will be reflected and part will be transmitted. We assume 
the source to be so distant that the wave may be considered plane. 
We also deal, at first, with wave surfaces of unlimited extent. 

It is assumed that the two media show no absorption, so that they 
are insulators, and the interface is taken to be the xy-plana of a rect- 
angular system of co-ordinates. Let n, the direction of the incident 
ray, be in the oz-plane. For the present we do not wish to make any 
special assumptions regarding the directions n' of the reflected ray 
or n" of the refracted ray. Assume that the planes determined by 
each of these directions and the z-axis mak3 angles <p' and <j>" respec- 
tively with the *z-plane. L>enoting the angles of incidence, reflection 
and refraction by 6, & and d" respectively (eee also fig. 1, p. 351), 
we have 

n — zsin0-f- koos8 1 

ri ~ i sin cos <f> -f- j sin & sin <f> — k cos O' i . (35} 

n" — i sin 5" Gos<f>” +j sinfi* sin <f>* -f & cos#' J 
The electric vectors of the three waves are then 

E' — a 

~ A'gS'o'i « — (x »in S' cob 4' -}-y ulo 6' sin $"- ■ * coa f)n, je ( + W $6) 

E" ~ A 

co»<li r +y^nC' t)n<y-hn<sm9 , 


while the magnetic vectors are given by 

H = H' — a /— [ri E' j, H" = J~ 2 [<E , a (37) 

At the interface we must hav.% for any values of x and y, at any 
instant 


E x + EJ = B s " \ 
E v + b; = E“ J 


(38) 


and 


E x + H x = E” \ 


(39) 
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This is possible only if the exponents of e agree in all three terms. 
This implies 

a>' — at" = to, (40) 

i.e. there is no change of frequency for reflection and refraction at a 
stationary surface of separation of two media. A second consequence is 
S' = S" = 0 or n, (41) 

which means that if any phase change occurs at the interface, it must 
be i r. Further, comparison of the exponents yields 
f = 4 >" — 0, 

i.e. both the reflected and the refracted rays lie in the plane of inci- 
dence — the plane determined by the incident ray and the normal to 
the surface; this is the aa-plane in our case. Finally, we obtain 

sin0=sin5' (42) 

the law of reflection: The angle of incidence is equal to the angle of re- 
flection'. also 

*4 sin 0 = n 2 mid" (43) 



the law of refraction (Snell’s Law): The sine of the angle of incidence 
is equal to the 'product of the index of refraction and the sme of the angle 
of refraction. (See also p. 371.) 


4. Polarization and Intensity Relationships for Reflection and Refraction. 

Preanel’s Formulse. 

The laws derived in the last section are also obtainable from almost 
any wave theory of light — in fact, from very elementary considerations. 
A more severe test of the electromagnetic theory is furnished by the 
question of the intensities of the various waves. While elementary 
wave considerations are of no help here, the electromagnetic theory 
gives the required answer directly from the boundary conditions. The 
equality of the exponents, which follows from the continuity of the 
tangential components and which is expressed by the laws of re- 
flection and refraction, yields the relationships (when we assume that 


A a + Af = A". ...... (44) 

d„+A v ' = A r ". (46) 

[wAL + [«'A']. = «[«"A"] (46) 

[hA], + [m'A']„ = n[n”A. , '] y (47) 
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In component form* 

[»A] e =— A^cobO; [«'A']„=4 v 'cos 8; [nW\,——A,'' coalT. (48) 
(«A] V = — A a sin0 -f- A„ cos0; [ri A']', — — A„' ehi6 — A w ' cos 0; 

[»"A"], ==> — A" sin 0" -f- A a " cos 0". . . (49) 

This resolution, however, is not the one appropriate to the physical 
problem. The natural resolution is 
that into one component in the 
plane of incidence and another nor- 
mal to this plane. In what follows 
we assume the light to be linearly 
polarized, i.e. the electric vector 
remains in a given plane, called 
the plane of vibration of the light. 
On the other hand, we shall avoid 
the use of the phrase “ plane of 
polarization ”, for this term is 
usually employed for the plane of 
the magnetic vector, and this is less 
important for the effects produced 
(e.g. the photo-electric effect) than 
is the plane of E. The component 
of the vibration normal to the 
plane of incidence is denoted by the 
subscript s and is given the direc- 
tion of the positive y-aads for all three rays: 

A'=A V , A; = A ¥ \ A," = A”. . . . (60) 

The positive direction of the components in the plane of incidence, 
which are characterized by the subscript p, is so chosen that it is ob- 
tained by rotating the direction of the ray through -f-90° in the plane 


of inoidenoe. We then have (cf. fig. 1) 

A x — A b oob0 I 

A/ =-A p 'cos0 l. ..... (61) 
A," = A P " cosfl' j 
A t — ~A P ain0 1 

AJ — — Af sin0 l (62) 

A" = — Af sin 6" J 

Substitution in equations (44) to (49) gives 

(Ap - A,') cos 6 = A, " cos 0" (63) 

A t + A; =A t " (64) 
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(A,— A,')coa8~- nA," cq&B". .... (55) 
A p + A p ■■= nA P ". ...... ( 56 ) 

These are four equations for the four unknowns A pj A/, A p " and A”, 
Their solution is very simple. We have at once from (53) and (56) 

A ’ — A wcog 0~ cos fl" a sin2g — sin2g '' . 

* s n cos 0 + cos (9" 9 sin 20 + sin 20"’ 


or A'- A teu (0-0") 

Ap ~~ A * Un(d +1*)' 

Likewise, from equations (54) and (55), 

A , _ . si it (6 — 8") 


’ "sin (8 -p 0")' 

Further, by substituting these values in (53) and (55) we have 
2 sin 6" cos 9 


A.” = A P 


A," = A s 


(e +1") oos if? — e") 
2 sin 9 " cos 9 


iin(0+ 0")' 


(5V) 

(58) 

(69) 

(60) 


The equations (57) to (60) giving the relationships between the ampli- 
tudes of the rays are known as Fresnel’s formulae, after their discoverer. 

The negative sign in equation (58) has the following meaning: 
If 9 > 6", i.e. if the ray encounters an “ optically denser ” medium 
(n > 1), A, changes its direction upon being reflected. On the other 
hand, for ( 6 + ®") < w/2, A p has the same sign as A„, but the positive 
direction is now reversed (p. 351 ), so that both components change sign on 
reflection from a denser medium, which means that the electric vector has 
a phase change of tt. This jump in phase is of importance in several in- 
terference effects that depend on difference of optical path, and must bo 
taken into account by the addition (or subtraction) of half a wave-length. 

The ratio of the amplitude of the normal component to that of the 
component parallel to the incident plane gives the tangent of the 
azimuth u of the plane of vibration. Division of equation (58) by 
equation (57') gives 


| tanu' j — tan« 


cos(0 — 6") 
cos Tf+H> 


(61) 


and similarly, from (60) and (59), 

| tan w" | — tan« cos(0 — 9") (62) 

From these results certain others follow. 
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In the first place, (61) shows that for 9 -j- 9" ~ w/2 we always 
have 

| tan u' j — oo , «' = |. .... (6F) 


Thus, if the reflected and refracted rays are perpendicular to each 
other, the plane of vibration of the former will be normal to the inci- 
dent plane; as we may say (see 57'), the component in the plane of 
incidence will not be reflected at all. If, then, we have ordinary light 
— i.e. light whose planes of vibration have any random orientations — 
reflected under these conditions, we obtain plane polarized light whose 
plane of vibration is normal to the plane of incidence. In the older 
literature the incident plane in these circmnsfcances is called the plane 
of polarization, and is the plane of the magnetic vector. 

If we combine Snell’s Law (p. 350) with the condition #„-f- QJ' -- w/2, 
we obtain the following relationship for the polarizing angle 0„, i.e, the 
angle of incidence for which natural light is converted into plane 
polarized light; 

tan#* — n. ....... (63) 

This is Brawstor’s Law. For the reflection of ordinary light at my 
other angle we obtain partially polarized light, for then the com- 
ponent in the incident plane is also reflected, although to a different 
degree from the normal component. Since 6 and 3" cannot be greater 
than, rr/2, we have for the reflected light 

| tana' | > t-anw, ...... (64) 

i.e. the plane of vibration of plane polarized light is tamed /arena from 
the plane of incidence upon reflection. 

According to (62), we see that the plane of vibration ox the re- 
fracted ray is turned toward the incident plane, but never comes quite 
into coincidence with it. Nevertheless, since the direction of this 
tinning does not change when the ray emerges from & denser medium 
— oos (f? -- 6”) having the same sign, whichever angle is the greater— 
wo car. polarize ordinary light to a high degree by sending it through 
a number of plane parallel plates separated by 'air films. This is the 
so-called “ pile of plates ” for polarizing light by refraction. 

For normal incidence Fresnel’s formulae give indeterminate values, 
but the solution of the system of equations (53) to (56) (p. 351) is 
particularly simple in this case. The results are 



(B 7U) 
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For perpendioular incidence there can. be no difference between | A p | 
and [ A, |, since the plane of incidence is entirely undetermined in this 
instance. The reflecting power R is defined as the ratio of reflected to 
incident intensity for normal incidence: 

p r A' ■ (n - 1)» 

R== T~~A*~ (n+Tp‘ ‘ * * * (66) 

Ex. 95. The degree of polarization is defined as the ratio 


Ex. 98. Show that a balance of energy flow is maintained for reflection and 
refraction by an insulator. 

5. Total Reflection. 

The law of refraction (Snell’s Law), 


leads, when 0 exceeds a certain value, to values of sin 0 " which are 
greater than unity — he. to imaginary values of d " — for light passing 
from an optically denser medium to one which is less dense (» < 1). 
The elementary theory contents itself with this observation and con- 
cludes from it that no light whatsoever passes over into the less dense 
medium — in other words, the reflection is total. But since we have 
already used complex functions in our analysis, we need not be dis- 
turbed by complex values of 0"; as a matter of fact, we obtain very 
important results by extending the Fresnel formulae to include this 
case. In these formulae (p. 362) we thus put, if sin 0 > n. 


8, cos0" — - Vsin s 0 — n a . . (67) 


cos 8 — \ Vsin a 6 — n 2 
m a 

v • . — 

cos 8 -f- ■^y'sin a 0 — »® 


The complex quantity by which A p is multiplied is of the form 


* sin 0 cos 9 4- sin 9" cos 0" 


where 


(70) 
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As appears from (36) (p. 349), 

EJ = (71) 


the component of E in the incident plane undergoes a change of phase 
of —2^ for total reflection. Similarly we fqid 


, , A cos 6 — *' Vsin*0 — n 2 _ u . 

A, MS A, . yaagss™ . = A,e M *, 

coso 4- * Vsm 2 6 — n* 


. (72) 


where 




Vain*fl— n* 
coafi 


(73) 


Thus the phase changes are not the same for both components; the 
components are displaced relative to one another. But this means that 
the originally linearly polarised light, in which the electric vector 
remains in a given line in a plane normal to the ray, becomes ellipti- 
cally polarized, i.e. the end point of E describes an ellipse. The phase 
difference 8 of the components is given, according to the above, by 
8 =2(^-0) (74) 


In order to determine 8 we consider the equation 

V sin a 0— l) 

W (76) 


tan-=tan(^—0)= 

2 l + taa<j>taatp 


•{‘+3 


Je- 




After a simple reduction this becomes 


8 cos 0 a/ sin 2 6 — n % 

— sn — • 


(76) 


Since the complex numbers by which A p and A, are multiplied have 
the modulus 1, the amplitude is not di min ished, and the term “ total 
rejection ” is justified. 

The results obtained by applying the Fresnel formulae to the. second 
medium where — according to the elementary theory — nothing should 
occur, seem quite remarkable at first sight. Sinoe the angle 6 " be- 
comes complex for values beyond ff/2, we set this angle equal to 
w/2 + t/3 and use the formulae 

C0Btj3 = coshjS; sint/3 = isinhjS; cosh*/? — sinh 2 /? = 1, (77) 

i.e. sind" = coshjS = 


- - Vain 2 6 — »*. . 


. (78) 


cos0" = — «sinhj3 
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A — x ccahfi . is ainhff V 
- A ~ C In, tin, ) 


■ m 

.This represents a non-iiomogeneous wave whose amplitude decreases 
with extreme rapidity as it penetrates into the second medium. A 
numerical example for glass and air {n = 2/3) shows th&t for 6 — 60° 
the amplitude decreases to 1/180 in travelling a distance equal to the 
wave-length. -These waves are propagated along the s-axis with the 
phase velocity e/(» 2 cosh /5). It. becomes easier' to understand that there 
is a wave running along the interface in the second medium if we re - 
member that a similar condition arises in the first medium, when we 
have an unlimited beam. The following transformation (of. p. tS7) shows 
that the superposition of incident and reflected waves results in a con- 
dition which may be looked upon as a harmonically modulated v/ave 
front travelling along the boundary: 


E„+E,' = 




i A P e 


r(‘"~ 




= 2 A 9 cos f 




T. 


(80) 


may be represented by a diagram of 
dmthe 

Modviiiid Wave Frvd 




> 


'esviar Medium. 


Amplitude 
Barer Medium. 


The state of affairs in both xm 
the type shown in fig. 2, in 
phenomenon in the second medium is 
represented as a continuation of the wave 
in the first medium. With regard to 
experimental attempts to verify the exis- 
tence of the !! layer of light >s in the 
second medium, the objection might be 
raised that, the wave would be disturbed 
by any such experiment, and hence the 
reflection in the first medium -would no 
longer be total. Nevertheless, it would 
be justifiable to extrapolate from a very 
small disturbance in the second medium to no disturbance whatsoever. 
As long as no disturbance is present there will, on the average, be no 
transfer of energy into the second medium; for if the Poynting vector 
g is calculated it is found that at any instant there .are places where 
energy enters alternating with places where energy leaves, and that 
these locations themselves change periodically. 

Bx. 97. What index of refraction with xespoct to air roost. a substance lmv<. in 
order that circularly polarised light bo obtained by one total redeoUoa! At what 
angle must light be incident on the surface of separation of glass and sir (a «* 2/3) 
in order to obtain oirealaiiy polarised light by two total reflections! 
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8. Absorbing Media, The Optics of Metals. 


la considering the propagation of plane waver-; in conducting 
media (Chap. XVII, § 4, p. 333) it proved necessary to Introduce a 
complex index of refraction, the quantities n and k being functions 
of K and a. For optical frequencies, however, we ms not permitted 
to use the statical values, since these quantities vary considerably 
with frequency for short waves. For this reason we characterize a 
conductor by n and x instead of by K and a in the optical region. 
If desired, these quantities might be calculated from r. and k, using 
formula (23) (p. 334), but this would yield nothing essentially new 
for the optical, region. The connexion with the electrical values does 
become effective, however, at the long infra-red frequencies. 

We must thus substitute 


said 

n{ 1 — %k)' 


.( 81 ) 


tcsd' r = 


J j~^V r n*( 1 -t«c)»-3n*#. . . (82) 


in the Fresnel formula*. The ratio AjjAf is again complex, be. there 
is again a phase difference between the components in and perpen- 
dicular to the plane of incidence, as in the case of total reflection. The 
reflected light is sliiptically polarized. 

We thus write 


AJ cos (9 — 8^) A s ^ 

A.j ~ A„ cot {6 -f- 8") J~ iJt ' 


. (83) 


If wo remove the difference in phase by inserting a compensator (a 
suitably cut crystal plate) which produces the opposite phase dif- 
ference, then the azimuth of the restored plane of vibration becomes, 
according to (83), 

i • • * * ^ 

By vT.it) ng the complex number 

cos (6 8") 

co S (T+n 


in the form pe il: we can find the phase difference 8 as well as the real 
quantity p of (83) for any angle of incidence; this determines the 
ellipse of vibration. In general, this ellipse will be placed in some 
arbitrary position in the plane normal to #«'. There is, however, a 
particular angle— -called the principal angle of incidence 8* for which 
the ellipse is so placed that one axis fails in the plane of incidence. In 
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this case the phase difference between A,' and A,' is exactly w/2. The 
azimuth of the plane of vibration restored by inserting a compensator 
which nullifies the phase difference w/2 (a so-called quarter-wave 
plate, usually made of mica) is called the principal azimuth «'*, the 
azimuth of the incident light being assumed to be that of the plane 
at 45°. Hence 

a; 

“27 = 


_ . , 1 -f tang* tang*" 

1 — tan #* tan #*"' 


( 86 ) 


It follows f 


a this that 

tan 8* tan 8*" =» 


(86) 


where tan* = - = cotu'* 

(cf. equation (84), p. 367), and hence 

*- 5 -*% < 87) 

The auxiliary angle x introduced here as an aid to calculation is 
thus identical with the complement of the principal azimuth u'*. 
If we substitute the values of sin 8*" and cos #*" from equations (81) 
and (82) in the expression tan#* tan#*"’, we have 


tan#* si 


Vn a ( 1 — «)* — sin 2 #* 


= e a< x. 


Separation of real and imaginary parts gives two equations for the 
unknowns n and k in terms of the observable quantities #* and uf*. 
The aotual values of n and k are such that j (« — iic) 2 | >■ sin 2 #*. 
Hence, if we neglect sin 2 #* under the radical, we obtain very simple 
approximate formulas which are sufficiently accurate for most pur- 
poses. Thus 


tan#* sin#* 

-a-*.) 


(89) 


Comparing moduli and amplitudes of the complex numbers, we obtain 


It follows that 


tan#* sin#* — nV 1 + /c 2 ] 
k — tan 2x f 


n = tan#* sin#* cos2x 
k as tan2x 


(90) 

(91) 
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Prom equation (66) (p. 363) the reflecting power of metals is seen 
to be determined by 


4L 

A 


n — 1 — im ae ia 
n + 1 — inK ~ 


|e« «-». , . (92) 


The intensity depends only upon the ratio of the squares of the ampli- 
tudes. The reflecting power is thus given by 


R = 


(t)‘ 


( n _ 1 )« + W V = »* + BV + l-2n 
(n ■+■ l) a + tiV n* -f + 1 + 2n' ' ( 


The greater the value of k, the nearer the value of this fraotion ap- 
proaches to unity. Hence all wave-lengths which are strongly absorbed 
are strongly reflected. Hence the colours of such substances by trans- 
mitted and by reflected light are complementary, and a thin film of 
gold, for example, appears blue by transmitted light. 

H a plane wave is incident obliquely on the surface separating a metal from 
a vacuum, the refracted wave is inhomogeneous, i.e. the planes of equal amplitude 
are no longer identical with the planes of equal phase. This is the case because 
the various parts of the wave front (which is a plane of constant phase) have 
traversed diSerent thicknesses of the absorbing material. Reviewing the derivation 
of the law of refraction (p. 349), one sees that this law must hold even when the 
index is complex; indeed, the Fresnel formulae, which involve the refractive index, 
were applied to metals. On the other hand, if we examine the sine relation with- 
out troubling about absorption, a lengthy calculation shows that the ratio of 
sines depends on the angle between the planes of equal phase and the planes of 
equal amplitude, and so must depend on the angle of incidence. Hence for the 
refraction itself the familiar law no longer holds. Because of the strong absorp- 
tion, however, the refracted , waves axe only slightly inhomogeneous, so that tide 
remarkable phenomenon is very inconsequential in practice. 



CHAPTER XIX 

EMCTBOatiGjrSTIO WAVES. TIT, PaOFAGATION 1ST AWtSOTSJOPIC 

Media, The Optics of Cxvstals 

1. i'ho Field Equations for Anisotropic Bodies, 

For anisotropic media--’ we have in view chiefly crystals which do 
not belong to the regular system — the relationships are more complex 
in that the polarizability of the atoms cannot be expressed by a simple 
scalar, as in the case of a free atom, for which’ this quantity averaged 
over all orientations is the same in all directions. Since it is part of 
a crystal lattice, the atom is no longer perfectly free to turn and hence 
it is no longer possible to average over all orientations. Also, in addi- 
tion to that' due to the character of the atom itself, a further element 
of anisotropy may be introduced by the presence of neighbouring atoms. 
If the polarization is different for the various components of B, the 
vector P no longer has the direction of E. One thing, however, 
must 'be preserved: the linear connexion between P and B. - & thus 
becomes a linear vector function of E, and hence the susceptibility 
is a tensor: 

P ■= WE, 

But the vector of the dielectric displacement D also Becomes a linear 
Emotion of B, and the dielectric constant is now a tensor: 

P = E A- 4-jtP «r= 1 -f AwTE = m. ... (2) 

In component four., 

D, « K n E x 4- K lt B, + X IS E„ : 

Di^KnB.+ KaE' + KuJB, k ... (3) 
IJ, = K. Al E,, -f- E^E, + K o3 E, J 

Since we deal principally with transparent crystals, ferromagnetic 
bodies, which at the same time possess metallic conductivity, do not 
corns into consideration, and we can set the permeability equal to 
unity. With this stipulation, the second field equation remains un- 
changed. How must the first field equation be written for the present 
case? Sines every change of polarisation means a motion of clec&rio 

380 
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charges, or a current, of density dP /dt, we have in genera! for all media 
_ ! 9E 4ir 9P = 1 SD 

cdt c dt e di' 


cuxl H — - -r~ -f 


(4) 


If we consider the derivation of the field equations in Chap. XVIII, 
§ 1 (p. 343), which was based upon perfectly general principles, we see 
that in order to obtain the form (4) we must make the following as- 
sumption for the energy density in the general case, i.e. for anisotropic 
media: 

" - 1 ...... ( 5 ) 


8n 


The energy flux vector, which contains only E and 3 
bo given by 


, continues to 


The conservation of energy is thus expressed by 

f£i Bn*--* £/(*»+«•>*. • 


(6) 


~ f (H curl E — B curl H) dr^-~ ~ J(ED + £C*)<jr 

• ■ « 

We again obtain the field equations as on p. 344 by equating separately 
the factors by which E and H are multiplied. For the second equation 
this offers no difficulty, but the first equation in the form (4) reunite 
only if 

..( 8 ) 


dt ' 


dt ' 


But- thiR means that is a symmetric tensor, for if the component 
form of the condition, vis. 


*-Tr^ 

dE m 
• dt ‘ 
3 E, 

' dt 


*17 


+ K 13 E, 

- + ^saC'i, 


.. dE. 


“31 


+ V. 3 -# + + K S3 E t 
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TC P i v p dE x , rr p d E x • 

K n E a -gj + K lt E v + E 1S L, -jj- 

, dE„ , dE„ 

•TT 

? 2 E m k „ 3 E t „ „ dE t 
J *~di + KziEy It + KszE, ~dt 


(S') 


is to be valid for all values of E x , E v and E„ we must have 

K a ^Ku (9) 

In this case the connexion between D and E may be visualized by 
means of the tensor ellipsoid. If we put E = r* and draw the surface, 

Dp* — K-^x 2, -f- 2K 1 ^poy -f- K^ 3 -f- 2^ 23 yz -f- E as z s -j- 2K SL zx — 1, (10) 


then, according to p. 37, the direction of D is given by the normals to 
the surface, while its magnitude is the reciprocal of the projection of 
r on the normal. In what follows we shall employ only the co-ordinate 
system of the principal axes, in which system the equation of the 
ellipsoid is 

K lX '» + + K ul z'* = 1. . . . (11) 

K x , E n , K m are called the principal dielectric constants of the crystal. 
The axes of the ellipsoid are proportional to the quantities 
1 1 1 

Vk x VEZ Vk^' 

Now, in an isotropic body, the velocity of light is c/VK. Similarly, 
we call the quantities 

c c c 

vw; vk vk^ 

the principal velocities of light in the crystal. The axes of this so- 
called Fresnel ellipsoid are thus proportional to the principal veloci- 
ties of light. In the system of the principal axes we have the simple 
relationships : 

D/ = EjE^y Dy. = E a Ey, Dy — . (12) 


However, since K x , K n and K m are unequal, the resulting vector D will 
no longer be parallel to E. 


* Of coarse, this relationship mast contain some dimensional factor, since we 
cannot set an electrical field strength eqaal to a length. Suoh a factor, of magnitude 
unity, is assumed to be oodtained in this equation. 
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In this same co-ordinate system we also have, by equa- 
tion (12): 

E V ^±D W , . (13) 


i.e. the principal axes of the Fresnel ellipsoid are also the principal 
axes of that tensor ellipsoid whioh, reciprocally, gives E as a function 
of D. Its equation is 


K 


(14) 


Thus the axes are proportional to the square roots of the principal 
dielectric constants — that is, the principal indices of refraction. The 
figure is thus called the index ellipsoid. The corresponding tensor is 
denoted by ® _1 , and so 

E = 0- l B (15) 


2. Plane Electromagnetic Waves in Anisotropic Media. 

We assume the electrio vector to be representable by a plane wave 
whose planes of equal phase are taken normal to the unit vector n. 
Let the phase be propagated in the direction of n with the velocity v. 
Thus we have 

E = E 0 e“(‘“7). (16) 

We then have, from the second field equation, 

— ccurlEoe^O"?)^ c[eo, grad^O"")]. . (17) 

Since grad m — n, we obtain after integration with respect to t* 


H = ?e i “( , "7)[«E 0 ]= Hoe'-O"?), . . (18) 

and Ho*J[fdy. ...... (19) 


Thus the magnetic vector is normal to the plane of E and n. 

• For if we put ra - u, grad u becomes equal to a and 

grad j-O-ulv) - - ~ grad «, 

i.e. DB» grad tb (*-»/»)] [aBJ. 
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However, we do not yet have any information regarding the angle 
between E and n. From the first field equation, 

=3 c curl H — c curl Hoe" ('"e) = — e“ 0“ v) [«H C ], (20) 

Integrating, and inserting (19), we obtain 

R — [«[wE 0 ]] - D 0 *r(‘~*). . '(31) 

This signifies that D is normal to n. With respect to D and H we thus 
have pure transverse waves. From (21) we obtain an equation which 
is of fundamental importance in what follows: * 

D — ^ (E - n • «E), | 

2 ( 22 ) 

or ^D-E+»'»E = 0 

c a J 

This equation states that D, E and n 
lie in the same plane — H being normal 
to this plane, as we saw above. But 
the vector 

S=^[EH] 

also lies in this plane;., moreover, S 
forms the same angle 8 with n as does 
E with B (of. fig. 1). S gives the direc- 
tion of energy flow; this means that a 
bounded portion of the wave front 
travels in this direction while its normal 
continues to have the direction of n. 

The phenomenon is to be compared Fig. i 

with a company of soldiers executing a 

“ right oblique”. As may be seen directly from the figure, the velocity 
of the energy flow is 



V is also called the ray velocity. 

If we decompose (22) into components referred to the prin- 
cipal system of axes, with which we suppose n to make angles 

* The rubncript 0 ia now omitted, since the equation is true for the instantaneous 
ralueg ®, EC and 0 an well as for the amplitudes E., H„ end D* 




XIX.] ANISOTROPIC MEDIA 365 

a, /?, y, and if we express E in terms of D in the second term, 
then 


( 4 - 5 )*- 


i> — n3oos y 


The scalar product D n vanishes, since D is perpendicular to n- If we 
express this product in terms of the components given by (24), and 
if we divide by the non- vanishing product E«, we obtain the relation- 
ship 

COS 2 a COS 2 ^3 , cos 2 y _ 

rr*? xr? + zr? • ■ • m 

K~c* K n c 2 K m c* 

This is a quadratic equation for v z in terms of the components of n ; 
hence to every wave normal there correspond, in general, two phase 
velocities. For the direction of the *-axis, for example, we have 
SOS a = 1, cos J3 — • ■ cos y = 0. Hence 

* ~K n “ d * =TQ 

We may therefore replace cj\ r E l , ... by the principal velocities 
%, % and v in , and thus obtain the more elegant form 


If we know the values of v corresponding to a given direction — call 
them */ and v" — we obtain the directions of the corresponding vectors 
-*?' or 15" by substituting v‘ or v” in the following (equations which 
come from 124): 

. ( 26 ) 

* vf — v 2 V-J — V* v m *-+v* 


These two directions are mutually perpendicular, for we have 



D'D" = («E')(«E")o 4 
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ooa 2 a , cos 2 ft 


( »/ — v' 2 ) («, 2 — 1>" 2 ) ^ (v n 2 -t>' 2 )(t> n 2 “«" 2 ) 


cos 2 y 


Now we also have the identity 


-«' 2 )( Vni 2 -o" 2 )/- 


(t^-O" 2 ) COS 2 g 


: u'2 «,* _ t,"* - (V _ */2) («* _ „"*)’ 


_ c4 f cos 2 


t/ 2 «, 2 - «" 2 + o u 2 - »' 2 V - «"* 


COS 2 ft 


(27) 

(28) 

cos 2 ft 


cos 2 y 


COS 2 y 


(27') 


But since «' and e" are the roots of equation (25'), the sums of the 
positive and negative terms vanish separately which proves our state- 
ment. 

T'.a direction of D may be determined geometrically, using the index ellipsoid: 
Cut the ellipsoid by a plane normal to n. The principal axes of the resulting ellip- 
tical section give the directions of D' and D". Proof: The principal axes are 
characterized by the fact that r 2 is an extremum for these directions, with the 
auxiliary condition that the terminus of r is in the plane rtt — 0 as well as on 


the ellipsoid r<D _1 r ” 1. The extremal condition gives 

r8r = 0 (20) 

and the variations of the auxiliary conditions are 

ttSr = 0 and SrG^r =■ 0 * (30) 

Introducing the Lograngiau multipliers X and p, we obtain the equation 

r+ls + |irr = 0. ....... (31) 


Scalar multiplication by r and also by n yields the following equations, since 
rn — 0 and r®~ l r «= 1: 

X «= riwD-V, 

whence fl + w . n® ~*r — ®~'-r =0. ...... (32) 

Sinoe r = t)/p in this index ellipsoid — -p being a factor of proportionality — we 

obtain f 

^D-<h-^D+ h.m<X>- 1 D = ^D- E+ m . «E — 0. . (33) 

* For according to the way the index ellipsoid was introduced, E - ®~*(r) has the 
direction of the normal., and the condition that the variation Sr remains on the ellip- 
soid means that Sr rnclst be perpendicular to E. 

f The introduction af the factor p is neoessary when comparing equation (33) with 
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Identifying p x !& with t^/o*. we see that this im precisely equation (22) (p. 364). 
Hence the construction is justified. 

Having thus found the two vectors D' and D" corresponding to 
a normal n, the directions of the associated vectors E' arid E" are 
given as those of the normals to the ellipsoid through the termini of 
D' and D". These are, in general, not mutually perpendicular. The 
vectors H' and H" are normal to the planes of D' and n and D" and n 
respectively. There are two rays corresponding to each wave-normal, 
viz.: 

g' = A [e'h'] and S" = ± [E"EC"]. 

We designate the corresponding unit vectors by &*' and s”. 

While the direction of n leaves am ambiguity, that of D determines 
n as well as E and S, and the magnitude of v uniquely; for n, on 
one hand, must be in the plane normal to D, and, in addition, the 
vector E is uniquely determined for a given D by the index ellipsoid. 
Since n must lie also in the plane of D and E, the intersection of the 
two planes gives the direction of ft. The magnitude of v is obtained 
most simply from equation (22) (p. 864): If we multiply this equation 
scalarly by D, we have 

v — 

On the other hand, if we start with the direction « of the ray, we 
obtain completely analogous relationships in which D and E have 
interchanged their rdles. The fact that D, E and s lie in one plane 
may be expressed by the equation (of. Ex. 2, p. 10): 

D+oE-f j&’=0. (35) 

The coefficients a and are obtained by scalar multiplication of this 
equation by $ and by ft in turn: Since « is normal to E, multiplication 
by « gives 

B = — Ds, . 

and since ft is normal to D, multiplication by ft yields, after inserting 
the value of /3, 

( Ds) ( sn ) 

E n ' 

But from equation (22) (p. 364) we have 

Ds = ~ ^ ( E#4 )> 

equation (22), lor otherwise if we put r - D, comparison of these equations yields the 
meaningless result D* - c*/t A 


cVed Vdo _1 d 

W' ===C ~W"' 


■ (34) 
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so that 

C* , 0 s . a C* 

a=-~ 2 (ns)*-~~- 2 co S *d~-~ w 

Then we have 

pE-B + s.sD=0 (36) 

Comparison of this equation with (22) shows that D and E are inter- 
changed, and that d*/y a appears in place of v 2 /c*. 

If we denov.e the components of s — i.e. the direction cosines of the 
ray — by cos A, cos B, cos F, and express the first D in (36) in terms of 
E, we obtain the component equations 

Ds cos A „ Bs cos B „ CscosT 

E m = T , E v = *,= -j. (37) 

— fa *B — fs A ®t“.p2 

Using these components to express the vanishing of the scalar pro- 
duct Es and replacing K Jt ... by c 8 /?, 8 , . „ . , we obtain the 
equation 

tV 8 cos 2 A v n * cos 2 B t) m 2 cos 8 V _ 

v* - F 8 % 8 - F 8 + V* - F 8 * * } 

This is a quadratic equation for F 8 as a function of the direction of 
the ray. The directions of the corresponding S' and ®" are obtained 
as for D' and D" above, either analytically from equation (37) or geo- 
metrically as the axes of the ellipse cut from the Fresnel ellipsoid by 
a plane through the centre and normal to -s. The corresponding vectors 
D' and 3D", which are not mutually perpendicular, are then found 
from the ellipsoid in the usual, way. As demonstrated for D above, it 
also may be shown that E uniquely determines the wave. 

3. Normal Surface and Wave Surface. The 'Optical Axes. 

We now consider an optical disturbance spreading outwards from 
a point. This point need not be an actual source of light; but, as we 
shall see in the next chapter, may be any point of a wave front. We 
shall find that under certain conditions the so-called Principle of 
Huygens permits us to consider any such point as a new source of 
light. We may represent the light emanating from 0 in such way that 
a surface element — a portion of a plane wave front — travels in each 
direction. After one second, these wave fronts have travelled a dis- 
tance v in the direction of t*heir normals. If we measure off the corre- 
sponding segments OP ■— r — v on these normals, the end points 
form a surface called the normal surface. However, each of the actual 
elementary wave fronts does not lie on this a*irface, but somewhere 
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to the side of the corresponding point P (see fig. 2). Moreover, all 
elementary wave fronts emanating from 0 lie on a new surface called 
the ray surface or wave surface. First let us consider the normal sur- 
face. We obtain its equation by putting r in place of v and x/r, . . . 
in place of cos a, ... in equation (25') (p. 363): 

(v a 2 - r 2 ) (v m 2 — r 2 )x? + (v m 2 - r 2 ) (v 2 - r*)y 2 

+ W - **) (V — r 5 )* 2 = 0. . . (39) 

This is a surface of the sixth degree 
and consists of two sheets, corre- 
sponding to the fact that there are 
two phase velocities for every direc- 
tion. We wish to consider the curves 
of intersection of this surface with 
the co-ordinate planes, which — as 
planes of symmetry of the tensor 
ellipsoid — are of particular impor- 
tance. The intersection with the 
plane y = 0, for example, yields 
(«„* - r 2 ) (v m 2 - r 2 )x 2 
+ (V - r 2 ) (V ~r 2 )z 2 = 0. (39') 

This curve consists of the circle r 2 — 

»„ 2 and the oval of fourth degree 
(v m 2 — r 2 )^ 2 + (v 2 - r 2 ) z 2 => 0. If 
we now assume that «i < t> n < we see that the circle and the oval 
have real points of intersection in the plane y — 0, and only in this 
plane; for if we put r = v a in the equation of the oval, the coefficients 
of ac 2 and z 2 have different signs, and so the equation can be satisfied 
by real values of x and z. The directions of the lines drawn from the 
centre to these points of intersection are given by f 



These directions are called the optical axes, since in these directions 
there is but one phase velocity v. The wave normals coincide with one 
of the axes, and we take the one that goes, with the positive sign of the 
root. The direction cosines of the normals are 



a - v(i T ^r V^I_ cos ^ 0 '] 


Setting these values into (26') we get the single value of the phase 
velocity v' = v" — v a . But then the Y -component of D is indeter- 
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nunate, since cos as well as (1 /K n ) — (v^/c 2 ) vanish in (24). Foi 
DJD X we get, as we must, —cot a, since D is always normal to n. 
Whatever value we assign to the Y -component, D remains perpen- 
dicular to n, and so D may assume any direction in the plane normal 
to n. 

On the other hand, the geometric construction must always furnish 
two axes of the ellipse of intersection. The contradiction is only an 
apparent one, the explanation being that the elliptical seotions are 
circles in this case. Hence the optical axes are normal to those sections 
of the index ellipsoid which are circles. 

Since the axial direction lies in the ax-plane, the y-axis is one of 
the diameters of these circular sections. This is important in what 
follows. But since a circle ha3 no uniquely determined axes, the direc- 
tion of D is undetermined. Thus the associated ray is also undeter- 
mined. It is not correct to conclude, however, that the various possible 
directions for the rays are obtained by rotating the entire vector 
system of fig. 1 (p. 364) about n, for the angle between D and E — 
which is the same as that between n and s— depends entirely upon 
the position of D. Moreover, n is not the axis^pf the cone containing 
the several rays, but is itself one of the generators; since the y-axis 
is a diameter of a circular section, it is also a permissible direction for 
D, but D and E have the same direction in this instance, so that s 
and n coincide. 

If the magnitude of v m approaches that of v w the angle between 
the two axes becomes smaller and smaller; in the limit v m ~ v u , both 
axes coincid| with the 2 -axis, and the circle touches the oval at the end 
points of the axes which coincide. This is the case for a so-called 
optically < miaoekd crystal. Here the Fresnel ellipsoid and the index 
ellipsoid are figures of rotation and, as seen from equation (39) above, 
the normal surface degenerates into the sphere r* = and the fourth- 
order surface of rotation 

(V - r*)3* + («* - r«) (y* + z*) = 0. . . (39") 

Of greater importance than the normal surface is the ray surface or 
warn surface, obtained by laying off the ray velocity along the direc- 
tion of the ray. The elements of the actual wave front are at these 
points, and are given by the tangent planes to this surface. The feet 
of perpendiculars drawn to these planes from the origin lie in the 
no rma l surface, which is therefore a so-called pedal-surface of the wave 
surface. The equation of the wave surface is obtained by putting 
r for V and xft ... for cos A ... in equation (38) (p. 368) : 

fhW - »*) M - r 8 )* 2 + V M - O W - r*)y* 

+ %,W - r 2 ) (v n 2 = 0. . 


- (42) 
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The term of lowest degree is rWV -f- y 2 + z 8 ). Since all other 
terms contain r 8 , we may divide throughout by r 2 , and the degree of 
the ray surface is not six but four. Correspondingly, the curves of 
intersection with the co-ordinate planes are circles and ellipses (cf. fig. 3, 
Plate I, facing p. 240). There is again a plane in which the circle and 
ellipse have real points of intersection. The lines joining the opposite 
pairs of these points are called the optical ray axes . If the ray has 
the direction of one of the optical ray axes, there are infinitely many 
normal directions corresponding to one ray, and these normals form 
the surface of a cone (see below). For uniaxial crystals the wave 
surface degenerates into the sphere r 2 == v c * and the prolate ellipsoid 
of rotation 




y 2 4- a 2 


1 = 0 . ' 


(43) 


4. Refraction of Plane Waves at the Plane Surface of an Aniso- 
tropic Medium. 

Since the continuity of the tangential components of E and of H 
(p. 346) must hold for anisotropic media also, we can make use of 
the considerations of § 3, Chap. 

XVIII (p. 349), from which we 
obtain the laws of reflection and 
refraction. It must be remem- 
bered, however, that n denotes 
the direction of the wave nor- 
mal, as may be seen from equa- 
tion (16) (p. 363). Hence, at the 
boundary of an anisotropic body 
the wave normal remains in the 
plane of incidence and obeys 
the Law of Refraction. But the 
recognition of this fact is not of 
much aid in constructing the 
refracted wave normal, for, in 
contrast with isotropic media, 
the value of the index of refraction now depends upon the direc- 
tion of this normal. This means that the direction of the refracted 
normal must be known in advance, in order to find the correct value 
of the refractive index. Fortunately, there is available a method which 
leads to the desired result. This method is often used to furnish a simple 
derivation of Snell’s law for isotropic media, and is based upon 
Huygens’ Principle. According to this principle, which will be dis- 
cussed in detail in the next chapter, each point on a wave front may 
be taken to be the source of secondary wavelets: the actual vave 
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front at any future time may be considered to be tbe envelope of tbe 
secondary waves. Let us apply this idea to a plane wave incident upon 
a plane surface of a crystal (fig. 3). We must then consider the secon- 
dary waves emanating from points in the surface, at the same time 
remembering that the arrival of the primary disturbance takes place 
at different times, since points in the same phase are on the line AB. 
By the time the incident wave has arrived at 0, the idsturbance 
coming from A has already spread out over a wave surface. Assuming 
the upper medium to be a vacuum, this corresponds to a time BGjo. 
The wave surfaces belonging to points lying between A and 0 should 
be drawn in corresponding ratio, but since the envelope of these wave 
surfaces is a plane passing through G, it is sufficient to take the plane 
through C tangent to the wave surface of A as, the new wave front. 
Since the wave front is of indefinite extent in a direction normal to 
the plane of the figure, we really mean the plane containing the line 
through G normal to the diagram and tangent to the family of wave 
surfaces originating in points on a line through A normal to the figure. 



Imagine a ray determined by a small surface element in the neigh- 
bourhood of A. After refraction, this element of the wave front must 
contain the point of tangency of the wave front with the wave sur- 
face corresponding to the point A. What happens for one wave sur- 
face in a plane normal to that of fig. 3 is shown in fig: 4. The top line 
in the latter diagram projects as the point A in the former figure. 
The two points of contact — one corresponding to each sheet — do not, 
in general, lie in the plane of fig. 4. Hence a refracted ray does not, in 
general, follow the law of refraction. 

Accordin g to what we have just seen, it is important to differentiate 
between refraction of the normal and refraction of a ray, i.e a limited 
portion of a wave front. This difference becomes evident if we con- 
sider the simplest instance of the passage of light through a plane 
parallel plate of anisotropic material. If a plane wave is incident 
normally upon such a plate, the normal will not be deviated; never- 
theless, there are two wave planes within the crystal perpendicular 
to the normal direction. These correspond to the two velocities of 
propagation belonging to each direction. If a plane parallel crystal 
plate is inserted in the parallel bundle of rays between the collimator 
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and telescope of a spectroscope, no double refraction is observed. 
The effect is quite different, however, if a narrow bundle of rays is 
used in place of a wide beam; in this instance double refraction takes 
place, for to each normal direction there correspond, in general, two 
rays. The results aTe particularly remarkable when the wave normal 
is in the direction of one of the optic axes of a biaxial crystal. In this 
case there are infinitely many rays corresponding to the normal, for 
it is possible to draw a tangent plane to the two sheets of the ray 
surface such that the plane touches the surface on a circle (a portion 
of this circle is seen in fig. 3, Plate I). Hence any ray from the 
centre which passes through this circle corresponds to this plane 
wave front, and the transmitted rays form the surface of a cone, 
which in turn is refracted into the form of a hollow cylinder of light 
upon emerging. This phenomenon is known as internal conical refraction. 

The effect reciprocal to double refraction of a ray is obtained by 
allowing light to enter a biaxial crystal in all directions through a 
given point on the surface of the plate. This may be done by diffuse 
illumination of a pinhole in a piece of cardboard placed on the plate. 
A given ray may be isolated by placing a similar diaphragm on the 
opposite side of the plate. This ray, however, will experience double 
refraction on emerging, since there are two different corresponding 
normal directions, each having its own value of the refractive index. 
If the transmitted ray is- allowed to have the direction of one of the 
optical ray axes, there will be an infinity of normals corresponding to 
this co-called single-ray direction. This ray is directed from the centre 
to the “ dimple ” in the wave surface (fig. 3, Plate I), at which point 
it is possible to draw an infinite number of planes tangent to the 
surface. The emerging rays fill a bone, and this phenomenon is known 
as external conical refraction. These conical refraction effects were 
predicted hy Hamilton and were verified by Lloyd, using a crystal of 
aragonite. 

In the previous discussion we tacitly assumed the incident light 
to be unpolarized, i.e. there was no uniquely characterized plane of 
vibration of E. Let us return to a consideration of the normal 
incidence of a plane- wave upon a plane parallel plate. There are 
two mutually perpendicular directions of D; since these are both 
normal to n they lie in the surface of the plate. As before, we 
denote these two vectors by D' and D". D and ft also determine 
the plane of vibration of H (cf. fig. 1, p. 364). If, now, the incident 
light is linearly polarized, with its magnetic vector forming an angle u 
with one of the vectors D, then we can resolve the magnetic vector 
into two components, one along D' and one along D". But each 
of the components has its own phase velocity, corresponding to 
the two different velocities belonging to n. As a result, there is 
a difference in phase when the light emerges — the original linearly 
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polarized light has become elliptically polarized. On the other 
hand, by- introducing a phase difference in this manner, we can 
convert elliptically polarized light into plane polarized light. This 
is the principle of so-called compensators for polarized light. 

Finally, we wish to consider briefly the beautiful patterns shown 
by plates of anisotropic materials when placed between crossed Niool 
prisms. If we allow polarized light to fall upon a crystal plate at angle 
of incidence <j>, then within the crystal there are two distinct normal 
directions, each with one plane of vibration of D, and hence of 
E.* In general, the plane of vibration of the incident light will not 
coincide with either of these directions, and so there will be a component 



along each. Both wave normals again have the same direction upon 
emerging, since we can apply the same construction here as wo did 
at the point of entry into the crystal (reversibility of the rays). But 
since the two waves have travelled over different paths within the 
crystal they will differ in phase; this means that the emergent light is, 
in general, elliptically polarized (cf. the special case of normal incidence 
treated above). From fig. 8 we see that the phase difference is 
given by 

, (AB.BB ag\ 

Denoting the angles between the normal to the surface and the re- 


* It would be wrong to assume that there are two planes of vibration corresponding 
to each of the refracted normals. As a matter of faot, the construction of the refracted 
wave front shows that corresponding to each of the two normals there is a definite 
normal velocity v, and hence by equation (26) (p. 366), a definite direction of D. 



XIX.] 


ANISOTROPIC MEDIA 


375 


fracted wave normals by tp' and ip" respectively, and calling the thick- 
ness of the plate d, we. have 

AB — AC — — BD = BC sin <A = d (tan \h” — tan Ji') sin A, 

COSi/i coss/t r r T 

ie, 

g _ ^ | /sin^i" sin<^ 1\ 1 / sin sin 

l\ c V'/coafi' V e 

Since the law of refraction holdB for the wave normal, 

sin^ airup" sin^' 

~c ¥' T~’ 

* , fcosi/f' cos ib") .... 

or $= cod | — -A- ^f- l (44) 

and for the common cs&e of approximately normal incidence, 

< M '> 

If the phase difference is 2 mr the emergent wave is again linearly 
polarized in the original plane, and can be extinguished by a second 
Nicol prism turned at an angle of rr/2 with respect to the polarizer. 
If we allow convergent polarized light to fall upon the crystal, there 
will be darkness for all directions <p for which the angles of refraction 
ip' and tp" yield 8 = 2 mr according to (44). These directions then give 
dark curves in the focal plane of a lens. But these. are not the only 
dark curves. There is a second possibility: for a given angle of inci- 
dence the plane of vibration of the incident wave can coincide with 
one of the possible planes of vibration in the crystal. In this case 
there is no second wave to bring about elliptic polarization. The 
resulting dark curve is called an isogyre. 

Ex. 98. A plate of a uniaxial crystal is cut perpendicular to the axis. Show 
that with crossed Nicola a system of oircular nngs can be obtained, with a rect- 
angular cross through their centre. 




CHAPTER XX 

Electromagnetic Waves. IV: The Theort oe Diffraction 

1. The General Diffraction Problem and Attempts to solve it. Xirch- 
hoff’s Formula. 

In previous sections we have dealt almost always with wave fronts 
of unlimited extent. In those cases where it proved necessary to 
consider limited beams, in the study of crystal optics, for example, 
we assumed that the lirpited beam was propagated in the same way 
as an extended one, and that the cross-section of the beam remained 
as determined by the aperture through which the light passed . Actually, 
this is not strictly true. Simple experiments show that light enters 
that portion of space which, according to our previous assumptions, 
could not be reached by the light wave — the region of the so-called 
geometric shadow. This phenomenon is called diffraction. 

Mathematically, the problem of the diffraction of a wave by an 
obstacle is to be formulated in this way: we seek a solution of the 
wave equation for E and H which satisfies the differential equation 
both for the region within the diffracting body, or obstacle, and for 
the rest of space; and which, moreover, satisfies the condition of con- 
tinuity of the tangential components of E and H at the bounding 
surface. At the same time it must be remembered that the electrical 
constants (K, jj,, a ) are different for the two regions. If it were possible 
to find such a solution, it would be expected to furnish information 
on all questions of intensity, polarization, &c., behind the obstacle. 
However, the mathematical difficulties connected with such a solution 
are so great that it has been possible, so far, to develop solutions only 
in especially simple cases, such as the diffraction of a plane wave 
meeting a sphere or a cylinder or a screen. 

For the greater number of practically important diffraction prob- 
lems we have to fall back upon an approximate method, which proves 
adequate if the conditions are not too unfavourable. Essentially, 
this method is merely the mathematical formulation of the principle 
that every point of a wave may be considered to be the source of 
secondary spherical waves. This Goncept we have already utilized in 
the preceding chapter. According to the original Huygens formu- 
lation, the wave front existing after a certain time is given by the 
m 
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surface which envelops the elementary wavelets. We made use of 
the principle in this simple form in our discussion of the refraction of 
light by crystals (p. 371). Fresnel, however, improved the method in 
this way: m order to find the resultant disturbance at a point of obser- 
vation P, he imagined the secondary waves arriving there at a certain 
instant to be superimposed, at the same time making allowance for 
their difference in phase. The mathematical statement of this idea is 
contained in a formula due to Kixchhoff, which we shall now prove. 

We represent the light disturbance by a scalar quantity U) this 
may be interpreted as one of the two components of E, which are 
perpendicular to the wave normal. The only essential fact is that the 
intensity of the light is proportional to U s . U must satisfy the wave 
equation 

A „ n- dm 

AU ^=Ji ...... ( 1 ) 


For the dependence upon time we make the usual assumption for a 


monochromatic vibration 

U — u(x, y, z)e*“V ...... (2) 

and obtain tbe following differential equation for the spatial part, 
which alone interests us: 

A«+**u=0. ....... (3) 

Here (4) 


Assume now that we have two functions, u and v, which satisfy the 
wave equation. We then apply Gauss’s theorem to the vector quantity 
u grad® — v gradw, integrating over an arbitrary volume in which the 
integrand is continuous. The result is Green’s theorem (p. 270): 

j) (u grad® — v gradw) (2 S — J(uAv — r&u)dr 

= fk 2 (vu-uv)d-r=0. (B) 

We take the function v to be the spherical wave originating at a point 
P in the region of integration. Apart from the time factor, this is 



But this function becomes infinite at P; hence if we wish to apply 
equation (5) we must surround this point by a small sphere. The sur- 
face of the sphere is then part of the bounding surface of the region 
of integration, and the point P is thus excluded fiom this space. The 
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normal is always directed outward from tie region of integration; 
hence in this case the normal is directed toward the centre of the small 
sphere (fig. 1). We now have the equation 

grad^— e —— grad iij dS 

+ iph,™ ( u ^ ~ ~V d «) ds =* °- • ( 7 ) 

The integral over the surface of the small sphere may be calculated 
readily. First we note that 
since u and grad u are con- 
tinuous, the contribution of 
the term 



vanishes when we allow the 
sphere to become infinitely 
small, sinoe dS decreases as 
r a , while we have only the 
first power of r in the de- 
nominator. Again, since w is continuous, we may replace it by the 
value u, at the centre when integrating over the small sphere. Finally, 
we have 

, e~**' /I . ik\ 

where r 0 is a unit vector directed outwards from the centre of the 
sphere. Hence we have 

dS = dS = — r$r 2 sin 6 d8 d<f>, 
and the value of the integral 

u, (f grad - — dS 

•^Sphere r 

in the limit (r = 0) becomes 4 iru r . Hence 

Up — ^~(f) g ra< l « — « grad ^ dS. 


(8) 


(9) 


( 10 ) 


This is the celebrated formula of KixchhofE.* It expresses the disturbance 
at a point P as the integral of the light disturbance over the boundary 
of a region containing P. If, for example, we are dealing with the 

1 *The formula wae first proved by Helmholtz, but is a special case of a formula 
due to Kirohhoff. 
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diffraction of light by a small aperture in a screen, the screen would 
be taken aa the boundary of the region. But in order to apply the 
formula, the diffraction problem must, in a way, be solved already, for 
one must know the disturbance in the opening and on the surface of 
the screen in. advance. KirchhofE made the natural assumption — 
which is amply justified by experience — that the disturbance in the 
free aperture is that which would exist if the screen were not present, 
and that u and gradw are aero on the screen itself. It is evident that 
with an assumption of this kind, which contains nothing concerning 
the material properties of the screen, we cannot attack the more 
delicate questions, such as that of polarisation. Nevertheless, if we 
do not require the distribution of light in the immediate neighbourhood 
of the edge, the Kirchho.fi assumptions prove adequate. 

2. Reciprocal Theorems of the Theory of Diffraction. Classification of 
Diffraction Phenomena. • 

We shall now apply the Kirchhofi formula to the special case where 
the primary excitation is from a point source Q and is represented by 


p* 1 » - - (j} eJU ' + *•"*'') 


where r, is the radius vector drawn from Q .,o a point of the aperture 
and r is the radius vector drawn from the point of observation P. 
The corresponding unit vectors are designated by the subscript 0 
(fig. 1). Since dS — coe{nr,)dS, we have 




We now assume that the distances from the source to the aperture 
and from the point of observation to the aperture are large compared 
with the wave-length. This condition is fulfilled in all cases occurring 
in practice. We may therefore neglect 1 Jr and 1/r, when compared 
with Js — 2 w/A, and so obtain 


£\P~ 7r~ ^ C0S ( nr ) ~ 008 ( nr J) dS - 
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Assume further that both r and r e are large compared with the dimen- 
sions of the aperture— a condition which is also realized in all practical 
cases. This permits us to look upon r and r a as constants where they 
occur in the denominator and in the trigonometric functions of the 
integrand; however* this is not allowable where these quantities occur 
in the exponent, since they are multiplied by the large factor k — 2w/A. 
Bringing the* constant values outside the sign of integration, we obtain 
the simpler formula 

«, » fe <H %?* . (15) 

Formulae (13) to (15) are entirely symmetrical in the co-ordinates 
of light source and point of observation — interchanging these quanti- 
ties merely alters the tlgebraic sign. This means that if a source Q 
causes a certain intensity at a point P, the identical source placed 
at P would cause this same intensity at the point Q. However, this 
must not be interpreted as a reversibility of the light path for dif- 
fraction in the sense that the use, as a source, of a distribution of 
light corresponding to the diffraction pattern of a point source would, 
in turn, have a single point of light as a diffraction pattern, i.e the 
aperture would act as a lens. The theorem says nothing regarding the 
intensity in the neighbourhood of Q. 

A second very remarkable theorem was discovered by Babinet. 
Imagine a number of apertures of arbitrary shape out from an opaque 
screen. Let these pieces which were cut out be placed in a large open- 
ing, this system then being considered a second screen. Two such 
screens, for which the openings in one are the same as the opaque 
portions of the other, are said to be complementary. Babinet’s theorem 
' states that the diffraction patterns produced by complementary 
screens are identical, except for the central spot corresponding to the 
direct light from the source. To demonstrate this we may imagine 
an image of the source formed by a lens of large aperture. In the 
limiting case of full aperture there would be no light outside the geo- 
metric image. Hence at a point P in the neighbourhood of this image 
the disturbance u would be zero. We now insert one of the screens 
between source and image. Let this cause the disturbance % at P. 
Let the disturbance produced by the complementary screen alone be 
m*. For no screen at all, the disturbance at P — which is zero — would 
be given by the sum of the integrals over both complementary screens. 
Hence by KirchhofFs formula, 

0 = Uj + m 2 , 

and == M a ®, 


which proves the theorem. 


( 16 ) 
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Consider now a plane screen containing one aperture (fig. 2). Draw 
a rectangular system of co-ordinates in the plane of the screen and 
place the origin within the aperture. Let the positive 2 -axis be normal 
to the screen and directed toward the observer. The co-ordinates of 



the source of light Q are x c , y q , z q , and those of the point of observation 
P are x, y, z. Denote the co-ordinates of an element of area of the 
aperture by $ and tj. The distances r and r e are then given by 


~(x-£)*+{y- V )* + z*, \ 

r* =(s e -£) 2 +&.-1J)* + *„*/' 


We further denote the distance from the source to the origin of co- 
ordinates by R g> that of the observer from the origin by R, and the 
direction cosines of the lines QO and OP bja^— —x q jR q , j8 0 = —yJR t 
and a = x/R, fi — y/R respectively. Expansion in powers of £ and 
r) now gives 

'+ r, = R+ %,+ £(* o- «) + 1 (A, - 0 + * + 1 

__ j Kg + fe ) 2 _ ^t+M + . 

— R 4- Rg -{- <f> (£, ij) (18) 

This representation by a power series furnishes us with a natural 
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classification of diffraction phenomena: If source and observer are 
bo far from the screen that we may neglect the terms of rf>(g, 57) which 
are of the second degree in g, ij, we speak of Fraunhofer diffraction 
phenomena; in other cases we refer to Fresnel diffraction phenomena. 

If we insert the development (18) in the Kirohhoff integral (15) 
we obtain 

«,= fe-m^dS. . (19) 

3. Fraunhofer Diffraction by a Slit and by One-, Two- and Three- 
dimensional Gratings. 

Following our classification of diffraction phenomena we imagine 
the source as well as the observer to be very far from the diffracting 
aperture, so that the waves which strike this aperture are essentially 
plane. This condition may be realized in practice by placing the source 
of light at the focus of a lens and by studying the diffraction pattern 
in the focal plane, of another lens (see Chap. XXI). 

As a first example, we study diffraction by a slit of width a which 
ie of unlimited length in the ^-direction. This means that the quan- 
tities occurring are independent of the {/-co-ordinate, so that the 
problem is a two-dimensional one in the zcc-plane. *We also write i and 9 
for the angles which an incident and a diffracted ray make with the 
normal Oz to the slit; hence a 0 = sini, and a — sin k 

The diffraction integral reduces to 

fait 

u f — const. / e -«<<*.- 

J-ajt 

= const. /*'%-**»*-*«#. . . . (20) 

'-alt 

The value of the definite integral may be written down at once, but 
an instructive way to evaluate it is to use the vector diagram, and 
proceed graphically. We divide the slit into small strips of width dg 
and add the contributions of the separate strips. While g increases' 
from —a/2 to a/2 in steps of amount dg, the phase difference increases 
in steps of k (sint — sin 8} dg. This means that the vectors representing 
two successive strips differ in direction by this amount. Since the 
contributions of all strips of equal width dg are of the same absolute 
magnitude, the series of index points form the vertices of a portion of 
a regular polygon. In the limit, where dg is infinitely small, the figure 
becomes a circular arc (of. p. 152). The figure forms a closed loop when 
the phase difference between the first and the last strip is 2n. A closed 
polygon means that the resultant disturbance is zero, i.e. there is 
complete darkness. In genoral, there is a minimum whenever 
i(sm* — sin 6) a -- 2 rm, (n = 1, 2, 3, , . .) 
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or (amt— sin#)a — nA. ..... (21) 

This is the condition for a minimum for diffraction by a single slit. 
Maxima occur between these minima at points where the resultant in 
the vector diagram is a diameter of the circle, i.e. where 

£(sint — sin0)a — (2 n -f 1 )w, (n = 1, 2, 3, . . . .) 

or where 

(sint — sinS)a — A (22) 


This, then, is the condition for directions of maximum illumination 
in the diffraction pattern of a single slit. 

For a given direction of 
incidence — which we may 
choose, for example, along 
Oz normal to the plane of 
the slit — the amplitude of 
the diffracted light will be a 
function of the difference in 
path for the two edges of the 
slit. The distribution of in- 
tensity may be found from 
(20), or from the vector dia- 
gram, by the method of p. 53. 
The result is shown in Sg. 3, where the dotted curve gives the ampli- 
tude of the resultant disturbance, while the intensity is represented 
by the curve in full line. 

If half the slit were covered there would be light in the directions 




of the previous minima when n is odd, since the second half of the 
index circle would now be missing. If we place a large number of such 
half-slits side by side we obtain a one-dimensional plane grating. let 
the distance between corresponding points in two neighbouring open- 
ings be called a. The total effect is obtained by addition of the dis- 
turbances from the individual slits. In those directions for which 

(sin* — sind) a — j»A, 


(23) 
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the individual disturbances plainly have the same phase, and hence 
there is a maximum of illumination (cf. fig. 4). This is then the con- 
dition for a maximum for the diffraction grating. 

Since, for a given direction of the incident light, the angles giving 
maximum illumination depend upon the wave-length, a grating of 
this kind is eminently suitable for the spectral analysis of fight. The 
number m is called the order of the spectrum. In what directions are 
the dark places to be found? From the vector diagram it is seen that 
the first dark spot occurs when the index points of the individual 
rulings (or slits) form a closed polygon. Thi3 is the case when the 
phase difference between the first and the last ruling is 2 w. Thus the 
rays coming from two adjacent lines of the grating must differ in phase 
by (27m -j- 2vjN) to give the dark spot in the neighbourhood of the 
maximum of order m, where N is the total number of fines in the 
grating. Hence the direction of the minimum adjacent to the mth- 
order maximum is given by 

(sin* — sin0)a= mA4- ... (24) 

This is, then, 'the condition for points of minimum illumination in the 
diffraction pattern of a grating. We have here a means of computing 
the resolving power of a grating, i.e. the smallest relative difference in 
wave-length which can be separated. Following Rayleigh, two wave- 
lengths are considered separate if the maximum of the diffraction 
pattern- of one coincides with the first minimum of the other. Let the 
wave-length A + dX have a maximum in the direction 8, i.e. 

(sin* — sin 8) a — m(A + dX). 

At the same time, suppose the first minimum of the wave-length A 
occurs at the same point: 

(sin* — sin0)a = mX -j- 
From these two relationships we obtain 
dX 1 
A ” mN' 

The resolving power ft is usually taken to be the reciprocal of this 
value, i.e. 

= A = mN . ...... (25) 

The directions of the maxima and minima obviously do not de- 
pend upon how the resultant of the disturbances from the individual 
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elements of the grating is formed. The only essential thing is that 
we have transparent or reflecting or scattering elements arranged at 
equal distances from one another. It is only the distribution of light 
among the various orders which is affected by the nature of the grating 
elements. 

We pass now to the consideration of what we may call a two- 
dimensional grating. This consists of a large number of apertures 
whose distance apart is uniformly a in the ^-direction and. h in the 
77 -direction. The conditions for maxima are obtained by the same 
construction as above, and gre 


(do — a) a — 1 

(A»~ f$)b — m 2 rt /* 


(26) 


For a given direction of the incident light, the direction of the maxi- 
mum corresponding to the order numbers and is uniquely de- 
termined, for the direction cosines a, j3 each determine a cone about 
the corresponding axis. The generators common to these two cones 
give the directions of the maxima corresponding to the orders m, 
and m 2 . Addition of a third condition — as in the case of the space 
lattice (see below) — would define a third cone about the zmxis. This 
cone, however, would not necessarily contain the common generators 
of the two other cones. 

While the two-dimensional grating has no practical application, 
the three-dimensional grating, or space lattice, is of the greatest impor- 
tance — viz. in the analysis of electromagnetic waves of very short 
wave-length (X-rays). For such short waves it is not possible to pro- 
duce appreciable diffraction angles with ordinary ruled gratings except 
by the use of special artifices. But nature has provided us with other 
gratings whose divisions are very much smaller, viz. crystals. The 
atoms in crystals are arranged at regular intervals, but form a three- 
dimensional lattice. In 1912 M. von Laue suggested the use of crystals 
as diffraction gratings and the experiment was performed successfully 
by Friedrich and Kmpping.* 

We limit our considerations to rhombic lattices, in which the lattice 
points lie in three mutually perpendicular directions at distances a, 
b and c. This, of course, includes the simpler gratings in which two or 
all three of the separations might be equal. The atoms act as sources 
of secondary waves, since they are set into vibration by the incoming 
light. Thus the secondary waves in this case have a certain reality, 
as contrasted with the Huygens wavelets in free space. Since the 
atoms are set into vibration by the incident light, they vibrate in 
phase, and the secondary waves which they produce (or scatter) are 


*M, von Lane, Phyt. Zeits. 14, p. 421 (1913). W. Friedrich, P. Knipping and SL 
70 n Laue, Le Sodium, 10, p. 47 (1913). 


! 711 ) 
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in phase, just as are ordinary Huygens wavelets. Hence the same 
treatment is applicable here. We now hape three conditions for the 
directions of maximum illumination: 


( a o — «) « = 

(A)“ 

(y 0 -- y)o = m 3 AJ 


(A- 

. , m 3 X 

(n - y) = ~ 


(27) 


(270 


or 1 



(27") 


As we saw from the cone construction for the two-dimensional grating 
(p. 385), the addition of the third condition overdetermines the prob- 
lem. This becomes evident mathematically by noticing that if we 
select Oq, A and y 0 arbitrarily, the values of a, ft, y obtained from 
(27") will not, in general, satisfy the condition a 2 ~j- A + Y z ~ 1- 
Hence, if we are to have a diffraction maximum at all, there must be 
a relationship between the direction and wave-length of the incident 
wave. If we square equations (27") and add them, we obtain the 
required relationship: 


A 


^ w, 2 m 2 2 , 7» s 2 ' 

7 ' ¥ T 7 


(28) 


A maximum is obtained only when this condition is fulfilled. We 
obtain a formula which lends itself more readily to direct interpre- 
tation if we square and add equations (27')- Denoting the angle 
between the directions of incidence and diffraction by 8 and 
remembering that 

CO80 — oo,, -f $/S 0 + yy 0 , 
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It the three order integers contain a common factor M, we place it 
outside the radical and have 


where H lt £f 2 , H s are relatively prime. 

Planes may be passed ‘through the crystal in various directions 
such that they contain a large number 
- \ of lattice points (of. fig. 5). A plane of 

Yfv \ * * this kind is known as a reflecting plane. 

|x\X \ ' A family of parallel reflecting planes is 

’ * characterized by the set of relatively prime 

^--1* integers proportional to the reciprocals of 
the intercepts on the axes, each being 

divided by the corresponding grating 

' A constant. For example, (010) denotes the 

Fig - 5 set of planes y — b, y — 2b, y — 36, &c. 

These indices, which are the H v J? a , H s , 
represent the reciprocals of the actual intercepts of that plane 
which is nearest the origin. This may be seen as follows: If the 
intercepts of one of the planes are I ls w 1 , ,n v measured in centi- 
metres, then 

g = £ : — : 

h "h. 

or after introducing a factor of proportionality t, 

7 ta tb tc 

?■“ s? 

The usual form of the equation of a plane whose axial intercepts are 
lx, m v n t is 

x y z_ xH i y H 2 zH s _ 


If this is to be a reflecting plane, the equation must be satisfied by 
integral values of the co-ordinates, i.e. by x — pa, y = qb, z = rc. 
Inserting these values in the preceding equation, we have 

pH i + 2#2 + r H e = t. 
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Since we have the sum of integers on the left, i must also be a whole 
number. Thus the smallest value of i is 1 . According to the results of 
analytic geometry, the distance from the origin of a plane whose axial 
intercepts are a ///->, bjH 2 , cjH a is 


(31) 


This is also the distance between two adjacent planes of the set.-. Thus 
we arrive at an entirely new interpretation of equation (30), There, 
after removing a common factor, the 11 ( represent the order numbers 
of the direction of diffraction. However, if we interpret the H { as the 
indices of a family of planes, we can express the radieand ic terms of 
the distance between the planes and thus obtain 

2d sing — MX (32) 


This is the condition for a maximum in the diffraction oattern of a 
space lattice, and is known as Bragg’s 
Law. The nature of these planes may 
be seen at once. It is easy to see 
that the plane half-way between the 
incident and diffracted rays (fig. 6) 
belongs to the set characterized by 
£Tj, H 2 , H a * Hence, on account of 
the equality of the two angles, we 
may look upon the diffraction process 
as reflection of the rays by the reflecting planes. The difference in 
path of the parts of a wave reflected from the first plane and the 
succeeding plane is given by (see fig. 6) 

AB + BB - AC = 2d cot 9 - cos | = 2d sin t (33) 

sin^ 2 2 :i 



Thus (32) signifies that for reflection from the individual planes the 
angle of incidence ir/2 — 9/ 2, the distance d between planes and the 
wave-length A must fulfil the condition that the difference in phase 
between the (partial rays must be 2Mn. Angles which satisfy equation 

* The equation of this plane is (s„ — p) 8 - (s - r) 8 or (s 0 — s)r - 0. If we insert 
for the components of {s 0 - s) the values given by equation (27') (p. 386) and remove 
the common faotors, we obtain 

& + M + S£ - 0 . 
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(32) are called glancing angles. In order to obtain such a “ reflection 
-at a depth ’’ with monochromatic radiation, it is necessary to rotate 
the crystal in such way that the entire range of angle between the 
reflecting planes and the incident wave normal is covered. This is the 
principle of the so-called Bragg Method.* 

Because the preceding derivation made use of a definite angle of incidence, 
it might be thought necessary to work with a parallel beam. For X-rays, such a 
beam could not be obtained by means of lenses, but solely by using two narrow 
slits. Meanwhile. Bragg and M. de Broglie discovered a focusing condition whose 
fulfilment guarantees that ev.en a diverging bundle of rays from a single slit shall 
give a sharp line for each wave-length. Thp condition is (fig. 7) that the point of 
observation P and the slit S shall be equidistant from the axis of rotation O, 



i.e. the spectrum falls on a circle of ra dius SO drawn about O. In the symmetrical 
position EK of the crystal, a given wave-length reflected at O will arrive at P; 
and for a.ny other position K'K', this same wave-length, and only this one, will 
go to the point P. If we draw a circle passing through 8, O and P, its intersection 
with K'K' will determine the point of reflection K. It is easy to see that SR and 
RP again make an angle of 0/2 with the surface of the crystal : angles SOP and 
SUP are equal, since they cut off the same are. Also, angles 8 OK and SIIO are 
equal since they are respectively the intercepting angle and the angle between 
tangent and chord corresponding to a given arc. Moreover, the latter angle is 
0/2. Then, also, angle POK is equal to angle PRK'. 

It should be noticed that the advantage of this arrangement is not in increased 
intensity but in the circumstance that various parts of the crystal co-operate, 
and so chance local defects in structure become harmless. 

Instead of turning a single crystal so that it assumes, successively, various 
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positions with respect to the rays, we can assure incidence at all glancing angles 
by using a mass of small, randomly oriented crystals. This is the powder method 
devised by Debye and Scherrer and independently by A. W. Hull.* 

Ex. 99. Give the relative intensities of the diffraction pattern^ of various orders 
for a grating consisting of alternate transparent and opaque strips of equal width. 

Ex. 100. The. Michelson Stellar Interferometer . — Two parallel slits whose dis- 
tance apart is b are cut from a screen. What is the appearance of the diffraction 
pattern if parallel rays are incident ppon the screen (at) normally, and (b) at a 
small a ngle e with the normal ? If both sets of rays are inoident at the same time, 
how does the visibility of the pattern de pend upon e and b ? 

Ex. 101. A cubical crystal of KC1 bounded by the planes (100), (010) and (001) 
is turned about an axis parallel to the line of intersection of the planes (100) and 
(010). The Ka ~ radiation of copper (A — 1641 X.U. — 1641 X 10~ u cm.) is inci- 
dent normal to this axis. Which reflecting planes with indices <12 oause 
reflection, and what are the corresponding glancing angles 8/2? The constant 
of the cubio lattice is 3-13 A.U. ~ 3T3 x 10~ s cm. 



4. Fresnel Diffraction Phenomena at a Slit and at a Circular Aperture. 

Zone Plates. 

Consider first a two-dimensional ease. Imagine the source to be 
a line of light of indefinite extent in the direction normal to the plane 
of fig. 8, so that the y-co-ordinate does not enter. Further, let the per- 
pendicular from the point of observation to the line source be normal 
to the plane of the screen, and let it pass through one edge of the slit. 
Since sin i = sin 0 — 0, i.e. a,, = a — 0, in this case, the development of 
t]) begins with 



(cf. p. 381). Replacing h by its value 2rr/X (equation (4) (p. 377), and 
supposing the slit to extend from £ — 0 to £ — a, we obtain for the 
disturbance at P the expression 

u (R+R*) re-W-Mix+VXJdg. . (34) 

XRR„ Jo 

• See G. I.. Clark, Applied X-Rays, 3rd Edition, p. 267 et tea., New York, McGraW' 
Hill (1940). 
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If we evaluate the integral as before, by summing the “ vectors ” of 
individual elements of the slit, we see that in contrast with the case 
treated in § 3 (p. 382), the vectors of the several elements d£ no longer 
make equal angles with one another; on account of the quadratic 



Fig. » 


term the angle between two consecutive vectors increases as 
i.e. the curvature of the vector curve increases with the length of arc. 
A curve of this nature has a spiral form. In order to obtain an integral 
independent of the special values of A, R, R Q we make the substitution 

4== 4 = 0 Vx(i + i) * (35) 

and obtain 

g 1 ^ (36) 

Va \r + rJ ° 

The curve 

Jo 


whose parametric equations in rectangular co-ordinates are 

x = f 0 Acoa (l s *) ds ’ “(?•')* • • < 37 ) 


is called the Comu Spiral (fig. 9). The numbers marked on the curve 
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are the values of A. Each number also gives the length of the corre- 
sponding arc of the curve measured from 0; this is seen at once, 
because we have dxjdA — cos trA a /2, and dyjdA = sin ttA 2 /2, so 
that (dxjdA)* + ( dyjdA )* = 1 or dsjdA — 1, where s is the length of 
the arc. 

Once we have drawn this spiral we can employ it to answer a, number 
of questions. 

First we notice that the radius vector from 0 to the point A of 
the spiral, whose relation to the slit width a is given by (35), gives 
the disturbance at P as a function of the slit width (with the sign of 
the imaginary part changed). As the figure shows, this radius vector 
passes through infinitely many turning values, whose difference be- 
comes smaller and smaller, the value approaching that of the asymp- 
totic point I when the slit width becomes infinite, i.e. the slit becomes 
a single diffracting edge. If, now, the half of the slit symmetric to the 
above is opened also, an equally great light disturbance must be added. 
For convenience we draw the spiral corresponding to the other half 
so that it is centrally symmetric about 0 with respect to the original 
spiral. It must be remembered, however, that for the second spiral 
the radius vectors are to be drawn from points on the curve to the 
origin. We are now in a position to give the value of the disturbance 
at P when the slit has any poi tion whatsoever on the f-axis — for 
example, when its edges are at g = a and jj=b. We need only sub- 
tract the vector corresponding to £ = b from that of £ = a, i.e. draw 
the line connecting the points on the curve which correspond to a 
and b. 

Further, we can- calculate with good approximation the distri- 
bution of light on a horizontal line through P parallel to the screen. 
If we limit our inquiry to points not too far from P we may neglect 
the small inclination of the lines QP' and we may expect the same 
disturbance at P' as would exist at P if the screen ended at 
$ — —xRJ(R -f R t ) instead of at £ — 0 (cf. fig. 8). With x = PP' 
increasing in negative values we traverse the points of the upper spiral, 
and the distances from the asymptotic points these points steadily 
decrease. Hence the illumination in the region of shadow declines 
uniformly. Displacing P by a positive x corresponds to withdrawing, 
the edge of the screen in the negative direction. In this case the lower 
spiral also contributes radius vectors, giving a series of maxima and 
minima in the region of illumination. The. limit for large distance from 
the geometric shadow corresponds to the line joining the two asymp- 
totio points, i.e. double the amplitude (or four times the intensify) at 
:the point P. This distribution of light corresponds well with obser- 
vation. 

Finally, we wish to investigate the distribution of light along 
a normal to the screen when the source of light is a point. 
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If we again have Oq = a = 0, the disturbance at P is given by 

«,= «-«*+*•> if . (38) 

For a circular opening (radius o) the integral can be evaluated, and we 
obtain 

«,= . (39) 

The intensity is given by the square of the modulus of the complex 
number, and this is given by multiplication by the conjugate complex 
number. Also 

(1 - e~ ix ) (1 - e+<») = 2(1 - cos a) = 4 sin 8 1. 

The result is 

'•“jars ; m 

Thus we see that for increasing size of aperture the illumination varies 
periodically. It must be borne in mind, however, that the formula is 
not to be applied to arbitrarily large apertures, since the inclination 
of the directions of r 0 and r is of importance in such cases. A maxi- 
mum of intensity occurs when 

I+i = (ta+i)A (41) 


If we compare this formula with the elementary lens formula, we see 
that the circular aperture corresponds to a convex lens of focal length 
f = a 2 /(2m -f- 1)A. However, we find no appreciable increase in in- 
tensity at the maxima here; this is first realized by diaphragming 
out those zones whose phase is such as to cause a decrease of the 
resultant amplitude. This corresponds to rendering alternate strips 
of a slit opaque, i.e. to the passage to a line grating. The division into 
zones in the present case must be such that the edges of adjacent 
zones, of which one half is covered, furnish a phase difference of 2mir. 
Thus we must have 




A _n _L 

R + R q 


Pn+l ~ Pr 


(42) 
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It is seen that this also satisfies the condition 4hat eaoh zone has the 
same area, since thiB quantity is given by ir(p n+1 2 — p n a ). The 
innermost circle, half of whose area is to be covered, has a radius 

ft — 


With a plate so divided into alternate transparent and opaque rings 
we can actually obtain a considerable intensification of light at points 
which satisfy the equation 

I . 1 2mA 

For this reason the so-called zone plate is sometimes referred to as a 
zone lens . 




CHAPTER XXI 


The Elements op Geometrical Optics and o*' 
Interference Optics 

1. The Fundamentals of Geometrical Optics. Laws of Fermat and of 
Mains. 

Geometrical optics has developed as a special field serving the 
needs of practical optics. Its purpose is to compute the details of the 
passage of light through optical instruments and to determine the 
principles of the construction of such instruments. The fundamental 
hypotheses of thie branch cf optics represent a great simplification of 
the actual facts. They arc essentially the following: 

1. Rectilinear propagation of light rays in a homogeneous medium. 

2. Independence of the several rays. 

3. Reversibility of the rays. 

i. The Law of Reflection. 

6. The Law of Refraction (Snell’s Law). 

Assumptions 1, 2 and 3 show that geometrical optics not only 
fakes no account of diffraction, but that it does not even contain 
any reference to the wave nature of light. Indeed, in the realm of 
diffraction phenomena the concept of a light ray loses all meaning; 
while it is possible to assign a ray in the direction of the normal to a 
plane wave incident on an aperture, the normals corresponding to the 
parts of the diffracted wave have very diverse directions. In the cal- 
culation of optical instruments with the aid of geometrical optics, the 
phenomena due to diffraction at the various apertures or diaphragms 
arc not taken into consideration. When needed — e.g. for the deter- 
mination of resolving power — these diffraction effects must be treated 
separately by the methods of the Theory of Diffraction. 

Two important general theorems follow from Hypotheses 1, 4 and 
5. The first of these was discovered by Fermat and is stated thus: 
The propagation of light always takes place in such way that the actual 
optical path (length of geometric path multiplied by index of refraction 
cf the medium) is an extremum (i.e. has a stationary value) compared 
with all other paths which do not follow the laws of optics, i.e. 

= Extremum. (I) 
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For media in which the index varies continuously, this becomes 

j^'n ds — Extremum (1') 

To prove this we shall show that the first variation of the optical path 
vanishes. For propagation in a homogeneous medium the law is self- 
evident. since the straight line is the shortest distance between two 
points. Again, for the case of re- 
flection, the proof may be reduced 
to this first case, since if the laws 
of geometrical optics are valid, the 
light reaching P 2 may be considered 
to come from the image P/; the 
path P x 'Pj is equal to the actual 
path and, being a straight line, is 
again an extremum. We give a 
proof also for the case or refraction. 
Let the refracting surface be the 
a$-plane of a co-ordinate system and 
let the plane of incidence be the 
yx -plane. Besides the actual path 
we consider a varied path (fig. 1) 
whose intersection with the ®y-plane has the co-ordinates 8c and Sy, 
while the “optical” ray passes through 0. Then, if X is the optical 
path, we have 

8Z- = %(»! -f Srj) 4- n 2 {r 2 + Sr.) — for. 4- rur.J 
~ 4- (& — %)* -f 4 MjV% 2 4- (is. - 

-njV^+y? - (2) 

Neglecting terms of second order, this becomes simply 



SZ = -/ 4 - J5S£|_«\ Sy. 
VvV+fc* Vzf + tf/ 


-y ~ S + « na i + % 

vV + Vi v V 4 ]Jt 

However, by Snell’s Law, the right member is zero, which proves the 
theorem for refraction. Since any optical path in geometric?.! optics 
may be considered to be a combination of reflected and refracted rays, 
the theorem is thus demonstrated in the general case. 

If an image of a point P is formed at another point P' many rays 
unite at the image, ail of them having travelled according to the laws 
of geometrical optica. Each ray mast represent a stationary optical 
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path, as compared with a neighbouring path which does not obey 
these laws. This is only possible if the optics,! path is constant for all 
optical rays. From the standpoint of wave optics, the constancy of 
optical path is a basic necessity, since the waves which converge in 
the image must all arrive in the same phase. 

In order to understand the second important, general theorem, 
the Law of Malus, it must be recalled that we distinguish between two 
kinds of ray systems in geometry— those which represent the normals 
to a surface (orthotomic systems of rays) and those having no ortho- 
gonal surfaces (non-orthotomic systems). The rays emanating from a 
point are certainly orthotomic, for the spheres with centre at this 
point are the required orthogonal surfaces. The Law of Malus states 
that the property of orthotomy is not destroyed by reflection or re- 
fraction, although the originally spherical orthogonal surfaces may be 
altered in shape. This iaw is self-evident from the standpoint of wave 
optics, for there must always be wave fronts corresponding to the rays. 
We therefore r drain from giving a proof from the point of view of' 
geometrical optics. One further consequence of the Law of Malus is 
important. If we consider the parts of two rays included between two 
surfaces, their optical paths must be equal in length. This must also 
be true for rays that intersect later on, in which ease the second wave 
surface passes through the point of intersection, where it has a singular 
point. 


2. Use Properties of Colliuear Projection. 

The ideal case of optical image formation is that in which ail rays 
taming from a point unite again at another point, while the images 
of straight lines are straight files, and the images of planes are again 
planes. A priori, one cannot say whether or not this type of image 
formation is possible with optical means. We shall assume that it is 
po'jsible, and without going deeper into the 'question we investigate 
the properties of this type of correspondence between the points of 
the object space and those of the image space . Such a correspondence 
between the co-ordinates rj, of P and if, £' of P’ is represented 
mathematically by linear fractional transformations 


t> ^ < h£ + hv + c d + A 
-r Kv + c o£ + <*o 
/ __ a 2% + &a u + + 4 

a oC + V/ + c o£ A- d Q 

fi __ a 3$ H ~ h 7 l 4~ ~r d 3 
«o£ + b 0 n + c 0 £ + d 0 j 


• (3) 
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As may be verified immediately, the equation of a plane 
l g' + W117' + n£' — p = 0 


in the image space corresponds to a plane in the object space also. 
The intersection of two planes — i.e. a line — in the object space corre- 
sponds to a line in the image space, and the intersection of three planes 
— i.e. a point— again corresponds to a point. We shall restrict our dis- 
cussion to axially symmetric (centred) systems. 

If the £-axis is the axis of symmetry, we need investigate the image 
formation only in a meridional section, using the co-ordinates g and 
rj. The symmetry of the image formation requires only that the g' 
co-ordinate be unchanged when ij is replaced by — 17 but that the 
sign of r\ change when that of ij is reversed. Hence for a centred 
system b 0 , b x , a* and d t vanish, so that we now have the simpler 
expressions 


Solved for g and 17, these equations are 

* d 0 f - ^ __ Kd 0 - Vi) V 

+ V V + aj 


• (3") 


The infinitely distant plane of the image space corresponds to the 
plane a 0 g -(- d 0 — 0 ; that of the object space corresponds to the plane 
a 0 g' — <h — 0. These arc called the focal planes of the respective 
regions. Their two points of intersection with the axis are called the 
principal foci F and F' of the system. The transformation formulas 
become still simpler if we move the zero points of the g- and £'-axes 
to the respective foci. We shall also take the positive direction of the 
axes in object and image space to be that in which the light is propa- 
gated, Denote the co-ordinates of this system by x and y and x' and y' 
respectively. We then have the transformation 






Introducing the notation 


h 

«o 


a A ~ **1^0 


=//', 




(5) 
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we then obtain 


xx' =ff ] 

y •” r) 


(6) 


The lengths /and/' introduced here are called the focal lengths of object 
space and image space respectively. We shall see at once that they 
are identical with the focal lengths introduced in the elementary lens 
theory. 

The ratio y ! jy is termed the transverse magnification. For the points 
of the plane x — — /, which corresponds to the plane x' — —f, this ratio 
is unity. These two planes are called the principal planes or unit planes. 
Their points of intersection - with the axis are the principal points 
H and H'. Any optical system may be schematically represented by 



an axis, the foci and the principal points. The distance e from a point 
of the object to the principal plane in the object space is called the 
object distance ; similarly, e', the image distance , is that of the corre- 
sponding image point from its principal plane. Then (see fig. 2), 

—e —f— x. =/' + x’ 
and the first equation of (6) gives 

(«-/)(«'-/')=#' or {+£ = 1. ... (7) 


For a symmetric system, / = — -/' and we obtain the elementary lens 
formula * 


1 I 1 1 

~'e + 7~f 


(T) 


In elementary optics we measure the image and objeot distances 
for a thin lens from the middle of the lens. For a more complicated 
optical system, e.g. a photographio objective, this is no longer allow- 

* This formula is usually written for the absolute values, i.e. without the minus 
sign attached to 1/e. The same is true for equation (8), following. 
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able. In this instance the unit planes (from which these distances are 
measured) may be far apart and even outside the lenses themselves. 

For the construction of the image we make use of a ray parallel 
to the axis and another passing through the focus (fig. 2). The former 
meets the unit plane of the object space at A; this point corresponds 
to the point A' at the same height on the unit plane of the image 
space. The ray from P through A and A', parallel to the axis, must 
then pass through the principal focus F’ of the image space. The ray 


1 

1 

Y/~ 


Y’ , 

Y 

r 

p(co) '/y 

i 

!f' 


Fig. 3 


from P through F, on the other hand, meets the first unit plane at B. 
This ray must then cut the other unit plane at the point B', which is the 
same distance from the axis, and must then travel parallel to the 
axis. The intersection of the two rays in the image space locates P', 
the image of P. A ease of particular importance is that where the 
object point is at infinity and in a direction which makes an angle u 
with the axis. This point corresponds to a point in the focal plane of 
the image space; its co-ordinate is found as follows: the ray in the 
direction n which passes through F (fig. 3) meets the first unit plane 
at A. This corresponds to the point A' at the same distance from the 
axis on the other unit plane. The ray then runs parallel to the axis, 
cutting the focal plane at P'. Then we have the relationship 

y' «= — /tan «. (8) 

This equation makes it possible to convert angular distances in (e.g. 
astronomical measirremenfcs) into linear distances in the focal plane of 
the observing instrument. 

In analogy with our definition of transverse magnification (p, 399) 
we may define the magnification in depth as the ratio of the difference 
of the abscissae of the two image points to the difference of the 
abscissae of the object points. This ratio, however, depends upon 
the abscissa itself, and thus is not definite except for infinitesimal 
differences. It is therefore preferable to define the depth magnification 
as equal to dx'jdx. Then, by (6) (p. 399) 

& = 

dx a? x' 


• • (») 
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Since this quantity depends upon x, we see that collinear projection 
does not preserve similarity. 

If / and f have different signs, as assumed, then a movement of the object 
will cause the image to move in the same direction. This is the case for reflection. 

The angular magnification or convergence ratio is defined as the 
ratio of the tangents of the angles between the axis and a ray in image 
space and in object space, the ray being such as to cut the axis in each 



region. This ratio is constant for a given point on the axis, for it 
follows from fig. 4 that 

tamt' f+x_f+x_(f+x)x_x /im 

ton u~f + x'- £-f'(x+f)-r * [ } 

J ' x 


The product of the transverse magnification and angular magnification 
for a given plane is constant for the entire system, for we have from 
equation (10) and equation (6) (p. 399) : 


y‘ term' _ / 

y tenw /'" 


(ID 


Combination of two imaging -processes 
The result of two successive collinear transformations is again 
collinear, and so there must be a single projection equivalent to the 
two. We set ourselves the task of determining the four cardinal points 
for the result, i.e. the focal points and the principal points or their 
corresponding planes. Fig. 5 shows these planes for each of the two 
processes, denoted respectively by subscripts 1 and 2. The same letters 
will be used to indicate the intersection of each plane with the axis. 
A ray parallel to the axis and distant y from it is drawn in the object 
space. After meeting H 1 and Hf it goes through the focus Ff. Its 
inclination to the axis in the first image space is 

tomq' = — 

Ji 


(12) 
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consists of two thin lenses, for each of which the principal planes may 
be assumed approximately coincident, the distance d between lenses 
may be written d — D + ff — /*, or d — D +f l > +ff (see figure). 
Putting this value of D in (14) and in (14') we get, with the lenses in 


1 

f 


_1 , 1 <L 

Si fa Sift 


and 7~k + k + M 


(15) 


If, in addition, the lenses are close together, the result is simply 


1 

/ 


= fi + fS 


(16) 


Hence for thin lenses in contact, the reciprocals of the separate focal 
lengths simply add up to give the reciprocal of the combined focal 
length. For this reason the reciprocal of the focal length of a lens is 
called the power <f> and is measured in diopters, one diopter being the 
power of a lens whose focal length is one meter. It must be remembered 
that the additivity of the powers — which holds, incidentally, for any 
number of lenses — is valid only when the same medium is present on 
both sides of the system. 

Ex. 102. The solar disc has an angnlar diameter, as seen from the earth, of 
1/2°. How large is the image formed by a lens of fooal length 2 m.? 

Ex. 103. The plane of an object is inclined to the axis of a lens system. Locate 
the image plane. 


3. Thft Practical Problem of Image Formation. Abbe’s Sine Law. 

General Path of an Elementary Pencil from a Point Source. 

(a) Imaging of points near the axis by means of narrow pencils 
Suppose that the angle between the rays diverging from a point 
bn the axis is so small that the sine and tangent may be taken equal 
to the angle itself. Assume further that only points near the axis are 
considered, so that their distances from the axis may be taken equal 
to the lengths of circular arcs ' whose centres are on the axis. Under 
these conditions, calculation of the refraction at a centred spherical 
surface leads to the collinear formulae. Since the net effect of suc- 
cessive collinear transformations is again collinear and since, to the 
above approximation, any surface of rotation may be replaced by its 
sphere of curvature, it may be said in general that within the limits 
of accuracy described, any system consisting of coaxial refracting sur- 
faces of rotation is collinear. This statement is true even if some of the 
surfaces are reflectors, since this case is merely one where n = —1. 

The truth of the above statement will be shown for a refracting 
spherical surface separating a medium of index », on the left, from one 
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of index n' on the right. The convention regarding algebraic signs will 
be that all axial segments to the left are negative, those to the right 
are positive; the curvature of a surface is positive in the convex side 
is facing the light. In fig. 6, let P be an axial point whose distance 
from the vertex of surface R is —s; and let a ray leaving P at an 
angle e with the axis arrive, after refraction, at another axial point P' 
whose distance from ft is s'. We must show that s' is independent of 
the angle e, i.e. that all rays coming from P again unite at P'. Draw 



the radius to the point K and denote the angle RMK by y. Then, 
using the sine law of trigonometry and replacing the sines by the 
angles themselves, we have (see figure) 


s' -r_p 
' s ' V 


(17) 


Dividing one equation by the other and using the law of refraction in 
the form na — n'j 8, we have 


( s — r)s' __ n' 
(s' — r)s~~ n’ 


(18) 


"(HMbD- 


(19) 


The angles a and are not present in the result. The value of the 
expression 


-(H) 


remains constant, and it is said to be a surface invariant. Equation (19) 
may be written in the form 
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Comparing with equation (7), p. 399, and putting e — s and e — s' 
yields * 

(2D 

and hence r ~ — — • ....... (22) 

J n 

If (22) is substituted into (11) and if u is substituted for tan u, the 
result is ths important Hetmholtz-Lagrange equation 

nyu --- n'y'u'. ....... (23) 

Thus the expression n<y t u t is also an invariant. Equation (23) repre- 
sents the relation between angles and lateral distances in the object 
and image spaces even whed there are many refracting surfaces, 
whereby it should he noticed that a large image corresponds to a 
narro w pencil and vice versa. 

With the help of (22) we can now determine the combined effect of 
two successive imaging processes in the general case where the indices 
in object and image spaces are not the same. Combining equation (14), 
p. 402, with d — B — / 2 and neglecting d, we have 

. ? ~inJ* & k i+ $?’ * * m 

or, introducing the power sf>, 

n% 4> ~~ «. a $ s ^2 4*s ’ ..... (25) 

Thus for a lens of index rq having a medium of index n 0 to the left and 
one of index n s to the right, the last equation becomes 

n 2 tf>’ = n, 4h + »s 4>i' t 

where tj> t ' and <}> 2 ' are, respectively, the powers of the front and rear 
surfaces. 

Up to now it has been shown, that refraction by a spherical surface 
can be described by a relation of the form of equation (7) by identi- 
fying —»//(»' -- ») with-/ and n'rj{n' — n) with/', and by measuring 
object and image distances from the plane norma! to the axis at the 
vertex of the sphere. In this case the focal planes are really distant / 
and/', respectively, from this plane, which thus represents the coinci- 
dent pair of principal planes. Thus, putting s ~ —or, in , (20) and 
combining with (21), we get /=•/'; in a similar way, for a’ ao we 
get a — /. If x and x' are now measured lxom the focal planes 
(a = x -}- / and ~~s — —x — /), then the first colli near equation of (6) 

* The factor# 1 In and 1/a' imply only a difference of ecalo in the two epassoa. 
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results. It remains to show that the second one is also satisfied. In 
order to find the object-image relation for a point Q near the axis 
(fig. 7), imagine the line PP together with the other rays to be rotated 
through a small angle about the centre M of the sphere. Then P goes 
to Q and P' goes to Q'. The arcs PQ and P'Q' may, to the approxi- 



mation adopted, take the place of the axial distances y and y*. Then, 
from the figure and from equation (19), 



If x and a/ are measured from the focal planes, so that we take 
— s = — * — / and s' = x’ +/', then the use of xx' = ff' gives us 


y' ^ x lL±ffL ^ X M±A 
y x f +ff /'(/+*) 




(27) 


so that also the second collinear equation is fulfilled. 

(6) Imaging of points near the axis by means of wide pencils. 

As soon as wider pencils of rays are used, the sine and the angle 
may no longer be taken equal and it is found that the image distance s' 
depends on the angle «. Rays emanating from P no longer re-unite 
in axial point P' but enclose a curve (or, in space, a surface of rotation) 
called a oaustic. It is possible, however, to find mathematically a 
refracting surface such that all rayB coming from P will converge again 
at P\ To obtain the analytical equation it is only necessary to apply 
the condition of constancy of optical path. For the case of reflection 
this surface is obviously an ellipsoid of rotation having the given 
points as foci, and it passes over into a paraboloid of rotation as P is 
moved to infinity. 

The imaging of an axis point is of little practical importance, for 
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even the neighbouring parts of an object lying in a plane normal to 
the axis are not sharply focused. If the problem is to form an image 
not only of an axial point but of an entire element of area lying normal 
to the axis — something that can be done by using several refracting 
surfaces — then a condition first discovered by Abbe most be satisfied. 
This condition will now be deduced, using Fermat’s principle (p. 395). 
Let the optical system, say a thick lens, be represented by the ourved 
Knes in fig. 8. The point P has its image at F; and P v which is in a 
plane through P normal to the axis, has its image at which lies 
in a plane through P' and normal to the axis. Rays drawn through P 
and Pi parallel to the axis Cross at I", while parallel rays drawn through 
this pair of points at an angle u to the axis will meet at some point i\'. 
It will not be concluded that F' and P/ lie in a plane normal to the 
axis; for this amounts to assuming that an infinitely distant plane 
is imaged, point for point, in a focal plane. This is more than we wish 



Fig. 8 


to prove, since our deduction is limited to an dement of area. Let the 
parallels to the axis enter the optical system at A and A y and leave 
it at A’ and A^. Similarly, let the oblique parallels enter at S and S 1 
and leave at S' and S t '. According to the Fermat principle, the follow- 
ing relations will hold for the optical paths, which are denoted by 
parentheses: 

(PAF’F) = (PSF 1 'F) and {PjAJT P,') - (W/ P/). (a) 

The paths (P v d 1 A 1 '.F') and ( PAA'F ') are equal, since F' is the inter- 
section of two rays that are normal to an orthogonal surface (cf. p. 402). 
Further, (F'P\) and (F'F) axe equal as far as small quantities of 
second order, because of the smallness of (P'P t '). Neglecting these 
term, we have 

(PAF'F) = (P 1 A 1 F'P 1 ’), (b) 

Combining with (a), (PSFj'P') — P^Fj'Pj'). ..... (c) 

Drop a perpendicular from P on to Pp and call the foot of it N. 
Then, because both rays are normal to the same orthogonal surface, 

{PSF l ’) = (NS l F l ') {d) 
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Subtraction of (d) from (c) yields 

(FYP') - - (NPJ + (Fr'P,'). , . . . («) 

Now erect a perpendicular P'N to the line F l , P f . As far as second 
order quantities, 

{F 1 'P 1 ')-(F 1 P')=(N'P 1 ') > ....(/) 

that is, (NPJ - N'Pf). ...... (g) 

If the index of refraction is m in the object spaoe and ml in the image 
space (the two indices would be different, for instance, for an immer- 
sion microscope objective), then 

{NPj) = ny sinw, (Ef'Pf) — n'y sinw'; . . (h) 
whence, according to (g), 

my sinw. — n'y' sinw'. ..... (28) 


In place of the Helmholtz-Lagrange equation we have the relation (28), 
while according to (11) and (22) colhnear projection amounts to the 
same relation with the tangents in place of the sines of the angles. 
This shows that collinear projection of the entire space, using arbi- 
trarily wide pencils of rays, is fundamentally impossible. Similarly, 
we could proceed to show, as in the deduction of the sine law, that if 
the sine condition holds for a given pair of axial points it is impossible 
for it to be valid for any other pair. The special case of plane reflec- 
tion is an exception. Here the whole of space may be imaged coliinearly 
using wide pencils, for since 5* =» y = y' and w = — the sine and 
tangent conditions are compatible in this instance. 

(c) Imaging of points distant fronn the axis by means of narrow pencils 

Even an elementary pencil from a point off the axis will fail to focus 
at a point after refraction at a spherical surface. Moreover, it is readily 
seen that rays in the meridian plane — the plane containing the axis 
and the point — have a different point of intersection from those in 
the plane normal to this one. Nevertheless, according to the theorem 
of Malus, every elementary pencil has an orthogonal surface. 

Imagine the pencil so limited that, the orthogonal surface in the 
image space is bounded by two pairs of neighbouring lines of curva- 
ture (fig. 9). As we know, neighbouring normals to the surface inter- 
sect along tbe lines of curvature. The normals at A and B intersect 
at E, those at C and D have their point of intersection at F. The 
straight line EF is normal to the principal ray drawn through the 
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centre of the -element of surface. In the same way, the second pair of 
lines of curvature furnishes a line of intersection GE which is in a 
direction at right angles to EF. The distances AE and AG represent 
principal radii of curvature of the surface element. The rays reunite 
at a single point only when these two radii are equal, otherwise there 
are two focal lines whose distance apart is called the astigmatic dif- 



Fig.cj 


ference A. An astigmatism of this "kind occurs when the principal rays 
meet a spherical refracting surface obliquely. 

Ex. 104. Show that the laws of reflection and refraction hold also for the angles 
whioh the incident and reflected (or refracted) rays make with any plane contain- 
ing the normal to the rurfs m of separation , 

Ex. 105. In a prism spectrograph, why are the lines curved ? 

Ex. 106. Compute the focal lengths and locate the principal planes of a thick 
lens of index n. 

Ex. 107. Find the form taken by the sine law for an snfhiibt-jy distant axial 
point object. (Introduce the height A at which a ray coming from an axial point 
cuts the first surface of the system.) 

4. The Resolving Power of Optical Systems. 

Even if the science of geometrical optica would enable us to con- 
struct an objective which, by compensation of the various imper- 
fections of single-point image formation, brought all rays from a point 
source (e.g. a distant, star) to a point focus, observation would still 
show the image to be a small disc surrounded by a system of rings. 
This pattern is due to diffraction by the edges of the lenses and dia- 
phragms. Although the point of observation is in the finite region — 
in our example the focal plane of the telescope objective— -the case is 
one of Fraunhofer diffraction, since diffraction at the aperture gives 
rays in various directions; this is equivalent to infinitely distant point 
sources of various intensities. 'The evaluation of the diffraction integral 
may ba performed for a circular opening with the aid of Bessel fimc 
tiona. Qualitatively, the intensity variation is not different from that 
found for a slit. However, for normal incidence, the first minimum 
is in a direction 0 given by 

r aind = 0-61 A, (29) 
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where r te the radios of the opening. A second point can be separated 
from the first if the angular distance between the two is at feast so 
great that the maximum for the second point falls npon the first mini- 
mum in the image of the other. Thus, for a oircukr aperture. 




0*61 A 


(30) 


The " limit of resolution ” thus defined cannot be influenced by in- 
creasing the magnification; its value, however, is inversely propor- 
tional to r, and this is the reason for the large dimensions of tele- 
scope objectives. 

Ex. 108. Calculate the resolving power of a prism in terms of its dimensions 
and the dispersive power dn/dX of the substance of which it is made. 


5. The Fundamentals of Interference Optics. Interference Fringes. 

In the construction of a certain class of optical instruments, the 
so-called “ interference ” instruments, the wave nature of light 
is taken into consideration in sc far as several coherent pencils 
are combined, this being done in such way as to allow for their dif- 



ference in phase. The diffraction phenomena occurring at the 
boundaries are not of essential importance in this type of device. 
If a plane wave be reflected from a wedge of small angle (fig. 10)), the 
superposition of the waves reflected from the front and back sides of 
the wedge will yield an increased or decreased resulting disturbance, 
depending npon the phase difference of ths emerging waves. Eut the 
phase difference depends only upon the optical path between the two 
faces of the wedge; thus there will be plaoes where the disturbance is 
intensified and others where it is weakened. If the incident beam is 
exactly parallel we shall not be able to see these places at all. As a 
matter of feofc, it is physically impossible to produce such a beam, 
since it would require a source consisting of a single point of light 
(whose surface brightness would therefore have to be infinite) placed 
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at the focus of a lens. In reality a source of light always has a finite 
extension, and since every point of the focal plane corresponds to a 
given parallel bundle of rays, the resultant beam always has a certain 
divergence. We shall assume, however, that this divergence is so 
small and that the thickness of the wedge is also so small that the path 
difference at a given point on the wedge is the same for all rays. The 
rays incident at this point then give a resulting illumination whose 
intensity depends upon whether the path difference at the front sur- 
face is an integrah number of whole wave-lengths or half wave-lengths. 
In the former case, the resulting illumination is intense; in the latter 
case, weak. The eye or a photographic objective focussed on the 
upper surface of the wedge will observe a system of light and dark 
bands, or “ interference fringes These are the “ curves of equal 



thickness ” of the wedge. As the angle of the wedge is increased, or as 
its thickness is made greater, even the rays at a given point have 
different path differences and the visibility of the fringes decreases 
considerably even for monochromatic light. 

On the other hand, if we allow a divergent beam to fall upon a 
plane parallel plate, the difference in inclination gives rise to a dif- 
ference in path of the rays reflected from the two surfaces. If the index 
of refraotion of the plate or film is the same as that of the surrounding 
medium, as in the case of the Fabry-P6rot interferometer, the path 
difference for an angle of incidence a is (cf. fig. 11) 

AO + CD — AB — -A 2 d tana sina = 2 d cosa. (31) 
cosa 

In those directions for which 2d cob a = wA there is light, while for 
directions given by 2d cosa = (2m -f- 1) A/2 there is darkness. All rays 
making an angle a with the normal to the surface yield a circle in the 
focal plane of an objective focussed for infinity, on account of the com- 
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plete symmetry about the normal, so that the fringe system consists 
of concentric circles. The rings become narrower if the thickness of 
the plate is increased, but — unlike the previous case — their visibility 
is limited only to the degree that the light source is not strictly mono- 
chromatic. If we assume that -a spectral line consists of, say, two 
strictly monochromatic components, there will be a condition where 
the interference fringes of one component fall between those of the 
second, making the resultant pattern indistinct. In reality, no 
spectral line is strictly monochromatic ; the fact that a radiating 
atom sends forth a succession of finite wave trains means that the 
spectral line actually consists of a continuous distribution having a 
sharp maximum. This may be seen by considering the finite wave 
train analysed into an 'equivalent sum of Fourier components. The 
result is that the interference effects due to the different wave-length 
regions overlap when the path difference is large. 

If we have but two interfering rays, the vector diagram indicates 
& decline of intensity according to the formula 

I — /nK.COS 8 ^, O .... . (82) 

where is the phase difference. If we allow a larger number of rays 
to interfere we obtain a considerably steeper decline of intensity, 
and hence greater resolving power for wave -lengths which are close 
together (of. p. 52). This condition may be realised by lightly 
silvering the -two surfaces of a plane parallel plate, so that these 
surfaces still transmit part of the light falling on them. Light 
• transmitted by the system consists of a large number of partial rays 
(in reality infinitely many, with decreasing intensity) which have 
experienced various numbers of reflections. This arrangement is used 
as a spectroscope of very high resolving power, and i8 the familiar 
interferometer of Fabry and Pdrot. If we had a finite number of partial 
beams of equal intensity we could apply the results obtained for the 
grating by means of the vector diagram. We found the resolving power 
of a grating to be (p. 384) 

R a= Nm. . (33) 

The difference here is that the number of rays of a given intensity 
is small, while the order — i.e. the path difference for neighbouring 
rays — is large in the present case. If the thickness of the piste is 
0-25 cm. and we have green light, A =- 5 X 10~ 5 cm. incident normally, 
m — - 2djX ~~ I0 4 , wlfile m is usually less than 5 for the grating. The 
fact that we really have infinitely many rays whose intensity de- 
creases geometrically modifies the considerations somewhat. There 
is no longer a sidewise minimum, and we must- impose a criterion 
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which determines the distance of a second maximum which just may 
be perceived separately. In practice, an interferometer of this type 
is characterized by the equivalent number of rays of equal intensity 
— a number around 20. Thus in our case this implies a resolving power 
of about 2 X 10 5 . To obtain an idea of what this means, let us con- 
sider the two D-lines of sodium, A — 5890, 5896. The fractional wave- 
length difference is about 1G -3 , so that the above interferometer would 
be capable of resolving two lines whose separation is about one two- 
hundredth that of the sodium lines. 

One disadvantage of this type of interference spectroscope, how- 
ever, is its small region of dispersion, which makes it necessary to 
use this instrument by crossing it with another type of dispersive 
instrument, e.g. a grating. If the incident light embraces a consider- 
able wave-length region, the resulting interference pattern will be 
hopelessly complicated. The “region of dispersion” AA is that dif-- 
ference in wave-length for which the m-order interference pattern 
of one wave-length coincides with the (m + 1) order pattern of the 
other. Then 

m( A -f AA) = (m -|- 1)A 

or AA = — . ...... (34) 

m 

Thus the region of dispersion in the example above is one-tenth the 
separation of the sodium lines. 

Ex. 109. Calculate the difference in path of rays reflected from the two sur- 
faces of a plane parallel plate of glass if the rays are parallel and, enter the plate 
at an angle of incidence a. The outside medium is air. 



PART IV 


The Theory of Electricity 
II. The Atomistic Nature of Electrical Phenomena 

In the preceding part of this work we considered a theory which found its 
complete formulation, for stationary bodies, in the Maxwell Field Equations. 
This theory made use of a number of experimentally determinable material 
constants — the conductivity a, the dieleotrio constant K and the magnetic per- 
meability [a, as well as the index of refraction n whioh is derivable from these 
quantities. But, as mentioned in several instances, these quantities are not in- 
variable material constants, but depend upon temperature, pressure, and — for 
rapidly varying fields— upon the frequency. By its very nature, the field theory 
. is powerless to give any information concerning this variability of a, K and p; 
rather, we must try to construct a picture, based upon suitable assumptions as 
to the struoture of matter and of eleotrioal oharge, whioh furnishes the correct 
functional dependence of these quantities upon the several variables. Thus in 
contrast with the field theory, which uses only macroscopio quantities, we must 
employ oonoepts of a microsoopio nature which, in part, are not oapable of direct 
verification and whose validity can be tested only by examining the conclusions 
obtained from their use. 



CHAPTER XXII 
Electrolytic Conductios 

1/ The Fundamental Phenomena of Electrolytic Conduction and their 
Interpretation. 

We begin with the consideration of a phenomenon the experimental 
laws of which are not explicable by the Maxwell equations, but have 
been known for a long time. These empirical relationships led, at an 
early period, to hypotheses concerning the nature of electricity. The 
phenomenon referred to is the conduction of electricity in solutions 
of acids, bases, or salts — the so-called electrolytes. 

Electrolytic conduction follows Ohm’s Law at all field strengths from 
the weakest to the strongest, i.e. at constant temperature the conductivity 
is independent of the field strength. Only recently have small departures 
from Ohm's Law been found for the highest fields. This type of- con- 
duction is accompanied by a transport of matter. The constituents 
of the substance, in solution are deposited at tbe electrodes in accor- 
dance with the familiar Earaday Law, which states that the electro- 
lytic liberation of one chemical equivalent * of a substance always requires 
the same quantity of electricity, viz. : 

F = 2-893 X 10 u e.s.u. gm. sec./mol = 9649 X 10 4 amp. sec./kg. mol. 

A chemical equivalent is understood to mean that mass of an element 
or group of elements (chemical radical) which could displace one gramme 
molecule (1 mol) of atomic hydrogen, i.e. 107-9 gm. of univalent silver, 
(l/2)(63-6) gm. of bivalent copper, (l/3)(138-9) gm. of trivalent lantha- 
num, &c. 

If we look upon the atomic nature of matter as a proven fact, 
as we now have every reason to do, the Faraday Law makes the 
atomic character of electricity seem extremely probable. One equi*- 
valent of silver contains L = 6-02 X 10 23 silver atoms. The transfer 
of this number of atoms is associated with the transfer of a quantity 
of electricity F, so that on the average each atom conveys a charge e 
given by 

6 ~ x ~ ~ 1 60 x 10-19 ° oulom * >s = 4* 80 x 10-10 ®- 8 - u - 

Since we always find this value in every experiment, it is natural to 
* In the M.K.S. system the kilogramme mol must Be used. 

15 «« 
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assume that this quantity is not only the average value — which is. all 
we are justified in concluding thus far — but that every ttnivalent atom 
actually carries a charge e, every bivalent atom a charge JP/(i/2) = 2e, 
&c. The charge e may be called the elementary unit of electrical 
charge — the “ atom ” of electricity. For reasons which we shall dis- 
cuss later, it is most commonly called the charge upon the electron. 
The existence of this element of charge has been demonstrated 
beyond question by a large number of experiments. On account' 
of their behaviour in electrolytes, we call atoms or atom groups 
which carry one or more elementary charges ions (Greek lov, going). 
What is the origin of the charged particles? We now know from 
X-ray crystallography that a simple crystal of a salt, e.g. an NaCl 
crystal, is built up not of neutral- atoms, but of positive Na + ions 
and negative Cl“ ions. ' When dissolved in water the lattice struc- 
ture is destroyed, but it is very probable that the ions retain their 
charges. It is, of course, possible that a certain part unite to form 
neutral NaCl molecules. 

In tire case of electrical discharge in gases, the carriers of the cur- 
rent are formed by the field itself, as we shall see later. This cannot 
be true in the present case, sinoe we find the same value of the con- 
ductivity for the weakest fields as we do for very strong ones. 

If, in one cubic centimetre of an electrolyte, we have N+ positive 
ions of valence z+, and N_ negative ions of a substance of valence z_, 
and if the speeds of the two kinds of ions are v+ and o_ respectively, 
then the charge flowing through unit cross-section per second is 

* == (z + N+v+ + ....... (1) 

for as many positive ions pass through the unit cross-section each 
second as are contained in a prism of length t?+, i.e. N+v+. Similarly, 
Njv_ ions pass in the opposite direction; this corresponds to a stream 
of positive charges in the direction of the field. Thus * is the value of 
the current density. Since the solution as a whole is neutral. 


N+Z+ = (2) 

so that 

i = N^z+e(v+ 4- «_). ...... (!') 


If Ohm’s Law is to hold, i.e. if i is to be proportional to the field 
strength E, then according to equation (1), v+ as well as v_ must be 
proportional to E and independent of the time. One might at first 
imagine that a constant force ze E, which acts on an ion of valence 
s, would produce an acceleration, i.e. a velocity varying with time. 
But in addition to the field, the ion is subject to the frictional resis- 
tance of its motion through the solvent. If we make the somewhat 
bold assumption that Stokes’s Law (p. 222) holds even for spheres 
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of atoifiio size, we have a resistance / proportional to the speed: 

/= 6mjRv, (3) 

where R is the radius of the sphere and 17 is the viscosity of the 
fluid. After a short period of acceleration theri is equilibrium between 
the force exerted by the field and the resisting force, so that 


_ zeE " 
V 6 tT7]R‘ 


( 4 ) 


The velocity attained in unit field is called the mobility of the ion: 


v ze 

U ~E~ 6rr^H‘ 


0 ) 


The mobility is usually referred to a field of 1 volt/om. Introducing 
the mobility into equation (1) (p. 418), we have 

* = (z+N +u+ + zJN_u_)eE = N+z+eE («+ + «_), . (6) 

and the conductivity becomes 

a = (z+N +«+ + zJV_u_) e == N+z+e (u+ + «_)• • (7) 

Thus the value of the conductivity alone does not give the mobility 
of each kind of ion, but only the sum of the two. In addition, we 
must know another relation between the two mobilities, e.g. that they 
are equal, or that their ratio has a given value. This may be deter- 
mined experimentally (cf. below). We see also from equation (6) that 
th^ mobility is greater the smaller the viscosity of the solvent. Now 
the latter decreases, for all liquids, with rising temperature, so that 
we have an explanation of the increase of electrolytic conductivity 
with increasing temperature. 

The ratio of the mobilities may be determined by Treasuring the 
changes in concentration occurring at the electrodes when a current 
passes. The natural diffusion tending to cancel these differences in 
concentration is prevented by inserting a porous diaphragm into the 
vessel. We shall compute the mobility ratio in this way for an electro- 
lyte whose ions are both univalent. There are deposited at the cathode 
not only the cations which have migrated there, but also those set 
free by the moving away of their oppositely-charged partners. That 
is, only one kind of ion need have appreciable mobility in order that 
there may be deposits at both electrodes. If it were otherwise, the 
electrolyte would have an excess charge at the electrodes; this charge 
would equalize itself by deposition of the superfluous ions. Fig. 1 is 
a schematic diagram of an electrolytio cell in which we assume the 
ratio of mobility of the cation to that of the anion to be S : 2. 'Only 
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those cations which are set free by the departure of the anions have 
any effect on the change in concentration at the cathode; Hence the 



deposited. deposited 


change in concentration at the cathode, expressed in gr amme molecules 
of the solute is ,, -n 

, Nju_E /a , 

An r — — y~~ > ....... (o i 

where E is the field strength and L is Avogadro's Number — the 
number of molecules per mol. The change in concentration at the 
anode is ,, „ 

fi) 

Since N_ — N+ when both ions are univalent, we have 


A n K _ u_ 
&n A u+ 


( 10 ) 


The losses in concentration at the respective electrodes , expressed in gramme 
molecules of the solute, are inversely proportional to the mobilities of the. 
ions arriving there. This relation was given by Hittorf. 

In this form, the law holds also for an. electrolyte whose ions have 
valences z+ and z_ The chemical formula of the solute is then K t _A, +l 
where K represents the cation, A represents the anion, and z + and z_ 
are relatively prime. It is true that we now have zJN_ — z+N + in 
place of simply 2V_ = N+, but the decrease in concentration at the 
anode, expressed in gramme molecules of the solute, is calculated from 
the decrease in the number of cations by dividing by z^L (cf. 
the chemical formula), so that the valences cancel in forming the ratio. 

The ratio of mobilities gives also the ratio of the parts of the total 
charge carried by each kind of ion. Denoting the fraction of the total 
charge carried by the cations by q+, that carried by the anions by 
g._ — 1 — q+, we have by equation (6) (p. 419), 


q+i = z+N+u+eE, 
q_i — zJd_u_eE, 


whence (H) 


Following Hittorf, q + and q_ are called the transport numbers. 
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According to equation (6) (p. 419) and equation (11) immediately 
above, the individual mobilities of the ions may be determined from 
measurements of conductivity and transport ratios. These mobilities are 
very small. The following are several values in cm. seo. _1 /volt cm. -1 : 
Li+ 36*8 X 10-* 

Na+ 46-0 X 10 -8 
K+ , 67-6 X 10 -8 
Mg++- 47*6 x 10 -8 . 

If we use these values in conjunction with Stokes's Law to determine 
the ionic radii, we obtain values of the correct order of magnitude, 
10~* cm. But we see that these radii are not the same as those of 
the free ions in the gaseous state, for — contrary to well-founded 
results of atomic physics — they decrease in the order Li-Na-K, rather 
than increase. The reason is that there exists a stronger electrostatic 
held at the “ surface ” of a Small ion than at the surface of a large one, 
assuming the charge to be symmetrically distributed over a sphere. 
This field causes the attachment of molecules of the solvent, the effect 
becoming more feeble toward the outside. The observed radius is, 
then, the effective radius of the ion together with its shell of molecules 
of the solvent. For this reason the mobility of the bivalent Mg+ + 
ion is not very much different from that of the similar univalent Na + 
ion, although for a given field, the force on the former is twice as great. 
If we omit from consideration the constituent ions of water, H + and 
OH - ,* we may say that the mobilities of all ions are approximately 
equal, for if an ion has a greater charge, and hence a greater driving 
force, it will also have a greater effective radius, and hence experience 
greater resistance to its motion. 

2. Dependence of Electrolytic Conductivity upon Concentration. The 
Theory of Debye-Htickel and Onsager. 

Let there be given an electrolyte in which p gramme equivalents are 
dissolved in each cubic centimetre of solution, and let the fraction 
ionized be a, while the fraction 1 — a remains in the form of undis- 
sociated molecules, and so has no effect on the transport of electricity, 
a is called the degree of dissociation. The conductivity is then given 
by equation (7) (p. 419) as 

a — apLe(u+ -f «_) = apF (u + + «_). (12) 

The ratio ajp, the equivalent conductivity A, would be independent of 
the concentration if 

1. a were independent of the concentration, and if 

2. the mobilities u + and w_ were also independent of this quantity. 

* These ions hare abnormally large mobilities: 329 x 10 - * for H+ and 180 X 10 - * 
for OH-. There are undoubtedly other factors whioh play a rflle here. 
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The observed very pronounced dependence ol A upon the concen- 
tration can be explained only by discarding one or both of the 
above assumptions. The first alternative — rejection of 1 and reten- 
tion of 2 — corresponds to the classical theory of electrolytes con- 
nected principally with the name of Arrhenius.* At first sight the 
fundamental assumption of this theory seems very plausible: the 
frictional forces are assumed to result wholly from interaction with 
the molecules of the solvent. This is quite probable; for example, in 
a molar solution of NaCl the ratio of the number of ions per c.c. to 
the number of water- molecules is about 18 : 1000. This hypothesis, 
however, overlooks the enormous electrostatic forces exerted by the 
elementary charge within a range of a few atomic radii. 

The presence together of molecules and their products of dis- 
sociation represents a type of chemical equilibrium governed by a 
thermodynamic law known as the Law of Mass Action (of. p. 542 
et seq.). For the case at hand this rule takes the form 

rl-j-Km, 

where the mass action constant K depends only upon the temperature 
T., From th i s we see that a approaches the value unity as the con- 
centration is decreased. For small values of the concentration we 
may set a — 1 in the numerator, in which case we see that the 
degree of association 1 — a is proportional to the equivalent concen- 
tration. If the classical formulation is correct, the ratio of the equiva- 
lent conductivity A for finite concentration to that extrapolated to 
aero concentration, A t „ must be equal to the degree of dissociation 
a, which in turn must follow the law of mass action. This is found 
to.be true for the greater number of. weak electrolytes, i.e. substances 
— chiefly organic acids and salts — having low values of A at moderate 
concentrations. On the other haftd, for strong electrolytes— the 
commoner inorganic acids, bases and salts — we find great deviations. 
In some instances the expression 

/AY 

” A 0 

varies by several powers of ten. For this?' reason theories based on the 
alternative point of view ware proposed from the very beginning. These 
speculations were based upon the assumption that dissociation is 
complete even for relatively concentrated solutions (up to perhaps 
1/100 molar), but that the ions exert mutually hindering forces upon 
* 6. Arrliemus, Zaitec&r. p/ty*. Chew.., 1, p. 631 (1887). Lehrbuch der Zieterachemie, 
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each other. It was Debye and Hlickel who first succeeded in over- 
coming the extreme mathematical difficulties attendant upon the 
computation of these inter-ionic forces. Thoir theory was considerably 
improved later by L. Onsager. We are able to give only the main 
features of the calculation here; the original papers should be con- 
sulted for details.* 

If we consider a given positive ion and note the number of ions 
of both kinds in its neighbourhood, there will be, when averagod over 
a long time or over many positive ions, a preponderance of negative 
ions in its immediate vicinity. If we compute the average distribution 
of charge in this region, we find it to consist of a spherically sym- 
metric negative charge whose density falls off rapidly with distance 
from the centre. Similarly, a negative ion is surrounded by a cloud 
of positive charges. The manner in which these clouds of charge come 
about may be Been by considering a crystal lattice (fig. 1, p. 573). 
Here also, the nearest neighbours of each ion are ions of opposite sign. 
We may picture the structure of an electrolyte in a similar way except 
that the lattice structure is replaced by one which is more u plastic ”, 
The computation .is made possible by imagining the charge in the 
neighbourhood of a single ion to be replaced by the average continuous 
volume charge obtained by averaging over the actual ion configurations 
from instant to instant. The density of this average ion cloud is indi- 
cated in fig. 2. Plate I, by different degrees of shading. 

Besides the Stokes resistance, there is a retarding force of the 
following nature: it requires a certain time for the cloud of ions to 
attain its spherically symmetric equilibrium configuration. If the 
central ion is moved forward a distance ds in the time di, the surround- 
ing charges tend fcc retain their former configuration. Tins means that 
there is a deficiency of charge of opposite sign on the forward aids and 
an excess on the rearward side. The electrostatic forces then retard 
the ion. 

In addition to this “ force of relaxation ", there is a further re- 
tarding effect, the electrophoretic force. According to p. 218, when a 
sphere moves through a viscous fluid, the velocity of the fluid at the 
surface of the sphere is the same as that of the sphere, and decreases 
outward. If, then, negative ions travelling in the opposite direction 
predominate in the neighbourhood of the positive ion, the resultant 
effect of the jurxounding ions is to oppose the motion of the positive 
ion; otherwise steted, this ion moves through a region of thte solvent 
which itself ia flowing in the opposite direction. The results obtained 
by quantitative investigation of these ideas are: 

1. The mobility of a given ion depends not only upon the nature 
of the ion and the solvent, but also upon the nature of the other ion. 

*P. Dobra and O. Httokoi, Pky*., 84, p. 306 ( 1923 ;. h, Gangor, iirid K 87, 

p. 383 mmy 
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The independence of ionic motion is valid only for infinite dilation. 
The numbers given in § 1 (p. 421) are limiting values of the mobilities 
at zero concentration. 

2. The two retarding forces are, like the Stokes resistance, propor- 
tional to the velocities of the ions, to a first approximation. Thus 
Ohm’s Law remains valid, although only to a first approximation for 
low field strengths. As a matter of fact, M. Wien found consider- 
able departures from Ohm’s Law for very strong fields (1$ 8 volts/cm.). 
He observed an inorease in conductivity with increasing field, in good 
agreement with the theory.* In the limit of infinitely strong fields the 
added retarding forces vanish entirely, so that the remaining depar- 
ture from the limiting value of the conductivity for infinite dilution 
is correctly ascribed to association. 

3. For very dilute solutions the departures from the limiting value 
of the conductivity, i.e. the quantities 1 — A/A 0 are no longer pro- 
portional to the concentration itself, as in the law of mass action, 
but to. the square root of the concentration. This rule was advanced 
as an empirical fact by Koblrausch long before the theory was pro- 
posed. 

4 . For a number of equally concentrated solutions of the same 
solute in different solvents, the departures are greater for the solvents 
of smaller dielectric constant. To see that the inter-ionic forces must 
increase with decreasing dielectric constant we need only recall that 
the expression for the force between two point charges has K in the 
denominator. 

The development of the cloud of ions about a given ion is imper- 
fect for rapidly alternating fields as well as for very strong fields. As 
a result, the added retarding forces are absent when high-frequency 
alternating currents, corresponding to a wave-length of 10 m. or less, 
are used. Going to still shorter waves, the electrolytic conductivity 
stops altogether when the wave-length reaches a few centimetres, as 
has been known for a long time both experimentally and theoretically. 
The reason is that the ions, on account of their inertia, are no longer 
able to follow the alternations of the field. This provides us with the 
connexion with optics. In the optical range, solutions of simple salts 
like vNaCi are to be considered insulators, as is shown by their 
transparency. 

Ex. 110. In order to gain an idea of the magnitude of the electrostatic forces 
calculate (a) the attraction between two elementary charges e — 4-80 X 10~ 10 
6.S.U. which are at a distance of 10 -8 om. apart; (5) the attraction between the 
positive and negative ions in 1 mol of rook salt which are separated and placed 
at a distance apart equal to the earth’s diameter (12,700 km.); (c) the surfaoe 
pressure necessary to hold together a sphere of 10 cm. radius into which all the 
ions of one sign are packed. 

* M. Wien, Ann. d. Phys., 83, p. 327 (1927). 
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The Conduction op Electbicity m Gases 


1. Direct Determination of the Elementary Electrical Charge by the 
Millikan Oil Drop Method. 

Before we treat the theory of electrical discharge in gases we wish 
to decide whether the value of the elementary charge on a univalent 
ion found in electrolysis is merely an average value (this is all we 
can infer so far) or the true value of a smallest unit of charge. The 
experimental proof of the correctness of the latter assumption was 
given by Millikan. The arrangement used by him was essentially 
that described on p. 263. The procedure is to observe the motion, in 
an electric field, of small test bodies carrying a few elementary charges. 
Millikan used droplets of oil formed by atomization. Their motion 
was observed in a microscope, the droplets being illuminated from the 
side. The droplets are in a vertical electric field. With the field off, 
the terminal velocity of fall u 0 is determined by the equilibrium be- 
tween gravity and the Stokes resistance to motion through the air. 
If we assume — as we may for oil drops — that the droplet is spherical 
and of radius a, and if we call the density of the oil a, that of the air 
<j a , and 17 the coefficient of viscosity of the air, then we have the equa- 
tion 

6in,<M> 0 =ya»(or-aj0. (1) 

From a measurement of u 0 we obtain the radius of the droplet, and 
thus its mass. If We now apply an electric field E which, with gravity, 
urges the drop downward, and if we again determine the velocity v v 
we have 

6n V av l =e i E+~aHa-^)g. ... ( 2 ) 
The charge on the drop is then given by 
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The result of Millikan’s experiment is that the charges e t are small * 
integral multiples of a unit whose magnitude is 

e = 4-80 X 10~ 10 e.s.u. = 1*60 X 10 -19 coulombs. 

For the exact calculation it is necessary to introduce a kinetic theory 
correction to Stokes’s Law, since the size of the drop is no longer large 
compared with the mean free path of the air molecules. Several in- 
vestigators have expressed doubt as to the justification of some of the 
assumptions embodied in the experiment, viz. the accurately spherical 
form of the droplets, the identity of their density with that of the oil 
in bulk, and the decrease in mass due to evaporation during th9 period 
of observation. It is fairly certain, however, that any errors attributable 
to these causes are very small. 

The Millikan experiment not only proves the existence of the 
quantum of electricity e, but also gives the most accurate value of this 
constant so far obtained. This quantity also furnishes a value of L, 
since the Faraday equivalent charge F — Le is directly measurable 
with great accuracy. There results 

L =• 6*02 X 10 28 mol- 1 = 6*02 X.10 28 (kg. mol)" 1 . 

2. The Nature of the Cathode Ray Particles. The Electron. 

If a glass tube provided with two electrodes is pumped down to a 
pressure of a few thousandths of a millimetre of mercury, and if an 
electrical potential of several thousand volts is applied, one observes 
a green fluorescence on the wall of the tube opposite the cathode, what- 
ever may be the position of the anode. The fluorescence is even stronger 
if observed on a screen of zincbl$nde placed within the tube opposite 
the cathode. The fluorescence is said to be caused by “ cathode rays ”, 
The fact that the fluorescence spot may be deviated by allowing the 
rays to pass through an electric or magnetic field is of prime impor- 
tance in obtaining a knowledge of the nature of these rays. The direc- 
tion of deviation is such as to indicate that tbs rays are composed 
of negatively charged particles. We obtain the same rays if we use 
as a cathode a wire filament heated to incandescence by an electric 
current. Moreover, with this cathode, we still obtain the rays when 
the tube has been pumped down to pressures at which it would no 
longer pass a currant with a cold cathode. This method of generating 
cathode rays is to be preferred, as we shall see presently, and we shall 
assume such an. arrangement to be used in the experiments discussed 
below. I>et the anode be a plane metal plate having a small hole in it 
and placed opposite the cathode. The greater part of the rays from 
the cathode strikes the anode. A current-measuring instrument oon- 

* Naturally, on account of unavoidable errors of measurement, only small multiples 
oaa bo demonstrated. 
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neoted between the electrodes indicates the passage of a current; thus 
the cathode rays transport electric charges. Let the narrow beam which 
passes through the hole in the anode strike a fluorescent screen some 
distance away. The path of the ray between anode and screen may 
be influenced by applying electric or magnetic fields. In analogy with 
electrolysis, we assume the cathode rays to consist of particles of mass 
m (still undetermined) and charge e. We note at once one important 
difference between the two phenomena: if we reverse the potential 
applied to the tube no current flows. The particles effecting the trans- 
port of electricity in this instance ar6 all of one charge, negative. 

We can calculate at once the velocity of the particles from the con- 
servation of energy: 

eV~%mv 0 *, (4) 

The values of e and m are still unknown; we seek to obtain the neces- 
sary relationships from experiments on electrical and magnetic de- 
flection of the particles. The deflection by an electric field may be 
calculated very simply. Let the field be normal to the original direc- 
tion of motion, which we take as the *-axis of a rectangular system of 
co-ordinates. Each particle is then subject to a constant force eE 
in the y-direction. The problem is thus identical with the computation 
of the trajectory of a particle thrown horizontally in a gravitational 
field. Starting with 

mx== 0 i 

my = eEj' (5) 

we integrate twice and apply the initial conditions, obtaining 

* = V ) 

= eEfi [ (6) 

y 2 m J 


Eliminating the- time, the Jrajectory is found to be the parabola 


The electrical deviation after passing a distance a through the field 
of strength E is then 


eEa 2 __ Ea a 
2mw 0 * 47’ 


(8) 


i.e. by inserting ther value of v 0 from (4'), ejm drops out of the equation. 
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Hence the electrical deflection alone gives only the potential F, which 
is directly measurable in our case. 

_ We seek to obtain a further equation from the magnetic deflection. 
In order to simplify the arrangement we apply the field along the 
z-axis in a direction normal to the initial direction of motion of the 
particles. On p. 309 we calculated the force exerted by a magnetic field 
on an element of a conductor carrying a current in a vacuum to be 


But a volume element containing charge of density p and moving with 
velocity v is equivalent to a portion of a convection current of density 
pv (cf. p. 418). The charge e is equivalent to pdr, so the force on the 
element of current becomes 


F = £[i>H]dT = ?OH]. 


It follows that the force is always normal to the vector v along the 
orbit, hence the magnetic field can do no work: it can change the 
direction of v but not its magnitude, so that the kinetic energy is 
not changed. Further, since F is also normal to H, no acceleration 
occurs along the z-axis. Thus the z-eomponent of the velocity remains 
zero if — as we assume — it was zero when the particle entered the 
field. On the other hand, the field imparts a ^-component of velocity 
to the particle. We have 


so that the component forms of the equation of morion are 


Integrating once and applying the initial condition i/hat the velocity 
has the direction of the x-axis and the magnitude v 0 when x = y = 0, 
we obtain dx e „ ^ } 

. ( 12 ) 

i=~ic Hx f 

Squaring and adding these two equations and noticing that the mag- 
nitude of the velocity is constant, we obtain the following equation 
of the trajectory: „ 
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This is the equation of a circle touching 
of radius mcv n 

p ~~eH‘ 


the a;-axis at the origin and 


(14) 


The 'paths of charged particles are bent into circular arcs in a magnetic 
field normal to the initial direction of motion. These circles lie in a plane 
normal to the field and their radii are directly proportioned to the initial 
velocity and inversely proportional to the strength of the fidd. If the 
magnetic field is of limited extent such that only the segment b of the 
z-axis is in the field, and if H is not too great, y 2 may be neglected as 
compared with y in the equation of the path, and we find the magnetic 
deflection to- be given by 


L jt 

wsj m 


. . ( 16 ) 


h m = Hb * 

** 2 mcv 0 2c a /2'FV 

where v 0 is replaced by its value from equation (4') (p. 427). This 
approximation is equivalent to neglecting the change in the com- 
ponents of v caused by the field itself in the right members of ('ll), 
i.e. to setting dyjdt — 0 and dx/dt = u 0 . It is not possible to determine 
v 0 , e and m separately, since we have only two equations, but equation 
(4) (p. 427) and equation (15) above represent two equations for 
v n and e/m. The numerical value of e/m thus found is 


- = 5-273 X 10 17 e.s.u./gm. = 1-759 X 10 n coulomb/kg., 


i.e. 1840 times a3 great as the ratio of charge to mass for the lightest 
electrolytic ion, the H ion. Since we always find the same value of 
e/m regardless of whether we use a heated cathode or a cold one, and 
regardless of the nature of the residual gas in the tube, it is natural 
to interpret the result by identifying e with the magnitude of the 
electrolytic element of charge and thus conclude that the particles in 
the cathode rays are 1840 times smaller in mass than the hydrogen 
atom. This interpretation is the only logical one, for if we were to 
assume the particles to have a mass equal to that of the atom of resi- 
dual gas, it would be difficult to explain the fact that e/m is indepen- 
dent of the kind of gas. We have thus disclosed the existence of a new 
fundamental substance, the atom of negative electricity, which we call 
the electron. Using the value of e obtained from the Faraday equivalent 
charge and Avogadro’s constant, the mass of the electron is found to be 
m 0 = 9-11 X 10~ 28 gm. 

The mass of the hydrogen atom is 1-67 X 10~ 24 gm. 

It is a fact of greatest importance that more accurate measure- 
ments show the ratio e/m to be not exactly constant, but to decrease 
as the velocity of the particles increases. Kaufmann was the first to 
succeed ,in demonstrating this variability. He applied transverse 
electric and magnetic fields to a beam of electrons moving in the 
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same direction but with various velocities in such way that the electric 
and m&gnetio deviations were at right angles to each other. Take 
a co-ordinate system in the plane of the observing screen, the origin 
being at the point where the undeviated beam meets the screen, with 
the x- and y-axes in the directions of electric and magnetic deviation 
respectively. Then, according to equation (8) (p. 427) and equation 
(15) (p.429). 


eEa 2 eHb 2 



If we look upon v Q as a continuously variable parameter, these are the 
parametric equations of a curve traced upon the screen by the elec- 
tronic rays of various initial velocities. Elimination of v 0 yields 

2 mcPEa, 2 » 

*= - W * ’ 

a parabola passing through the origin and having as axis thu sc-axis. 
Even if the experimental curve does not extend quite to the origin, 
it is possible to determine the tangent at the origin (which corresponds 
to infinite velocity) by reversing the electric field. The result of the 
experiment was, however, that the tangent formed a small but un- 
mistakable angle with the y-axis. In addition to ejm, the equation of 
the curve contains only constants of the apparatus. The discrepancy, 
is, then, interpretable only by assuming that ejm variee, and in such 
way that it decreases as v 0 increases. Now the relativity theory pro- 
vides for an increase of inertial mass with increasing velocity accord- 
ing to the formula (cf. p. 254) 



while there is no- reason whatsoever for assuming any variation of e. 
As a matter of fact, Kanfmann’s results are in good accord with this 
formula. Abraham deduced a formula for the variability of mass, based 
upon the concept of a fixed ether, which gives slightly different de- 
partures. More exact experiments, however, gave results decidedly 
in favour of the relativity formula (see p. 478). The relationship be- 
tween potential drop V and velocity must also be modified from the 
relativistic point of view. The correct energy equation replacing 
equation (4) (p. 427), is 



tV. 


(16) 
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Finally, all e/m determinations must be reduced to zero velocity; this 
has already been done for the numerical values given above. 

Ex. 111. In deflection experiments the screen is generally not exactly at 
the end of the field, but there is a field-free space between it and the edge of the 
field. Show that one may treat the deflections observed on the screen as if the 
rayB were bent at the middle of the field by the angle which the tangent to their 
path at the end of the field makes with the original direction of motion. 

Ex. 112. Theory of the “ Magnetic Lens A slightly divergent beam of elec- 
trons emanates from a point source, the vertex angle of this cone of rays being 
2a. Assume ah the electrons to have the same speed u. Show that a magnetic 
field parallel to the axis of the cone will re-unite the rays approximately at a 
point a certain distance away from the source. 

3. Survey of the Possible Methods of Generating Carriers of Electricity 
in Gases. 

Prdiminary remarks concerning the structure of matter 
and the interaction of light and matter 

At the present time we are certain beyond doubt that the atom 
is a complicated configuration of electric charges. The entire mass 
of the atom, with the exception of the negligible mass of the outer 
electrons, is concentrated in the positively charged “ nucleus ” 
whose diameter — 10~ 12 cm. — is vanishingly small compared with that 
of the whole atom (kinetic theory radius 10 -8 cm.). This nucleus is 
surrounded by a shell of electrons which fill a space of about the size 
of the kinetic theory volume. In an electrically neutral atom the 
number of electrons outside the nucleus is equal to the resultant number 
of positive elementary charges in the nucleus, and this number is 
identical with the atomic number, i.e. the number giving the place of 
the element in the periodic table. An atom vMeh has lost one or more 
of its outer electrons as a result of some external influence represents 
a positive ion, while an atom with an extra electron is a negative ion. 
A system having several nuclei is called a molecule. A molecule, as 
well as an atom, may become either a positive or a negative ion. It 
is chiefly the halogens, which come just before the noble gases in the 
periodic system, which tend to form negative ions. These elements 
form the singly-charged anions F~, 01~, Br - , I~. 

In a metal in the solid or liquid state, some of the electrons are 
able to move freely between the atoms. The state of this “ electron 
gas ” may be compared with that of an actual gas dissolved in a solid 
or a liquid. 

To remove a single electron from an atom or from the surface of a 
metal requires a definite amount of energy which is characteristic of 
the particular substance. This work of escape is generally specified by 
giving the potential difference through which an electron 'must fall 
in order to acquire this amount of energy. The magnitude of this 
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ionizaiic/n potential for free atoms varies from 3-88 volts for aesium to 
24-48 volts for helium. Once an electron has attained the critical speed 
it is able to dislodge an electron from the electron shell of an atom 
with which it collides. This happens relatively frequently as soon as 
the electrons have reached the critical speed, while the ionizing col- 
lisions of fast electrons are less frequent. The impact of a positive 
ion can also result in ionization of an atom, but the impinging particle 
must have a considerably greater speed, for according to the laws of 
impact, there is a considerable transfer of kinetic energy of trans- 
lation when two approximately equal masses collide. But even for 
speeds theoretically sufficient for ionization, the number of effective 
collisions is minimal. In contrast with electronic impact, the number 
does not increase until the velocities become greater. Instead of a 
positive ion, a neutral atom may impinge, and again ionization will 
occur tinder certain conditions. Since neutral atoms usually possess 
velocity as a result of thermal energy of agitation, this process, which 
is of importance at temperatures of 3000° C. and beyond, is called 
thermal ionization. There are thermodynamic formulae which make 
possible the calculation of the degree of ionization of a gas at a given 
temperature and pressure. 

Finally, radiation of short wave-length (<3000 A.) also causes 
ionization. In all - energy exchanges between atoms and radiation, light 
acts as if it were composed of discrete quanta of energy, of amount pro- 
portioned to the frequency of the light, which are transferred as units. 
The factor of proportionality is denoted by h and is Planck’s celebrated 
quantum of action, whose numerical value is 

k = 6-62 X 10~ 27 erg sec. 

In order that there be ionization by light — photo-ionization — the 
incident “ light quantum ” hv must be at least equal to the work of 
ionization. Calling the latter eV ( , we have 

hv 2; eV t (17) 

If we characterize the light by its frequency instead of by its wave- 
length, insertion of the numerical values yields the easily-remembered 
relationship between wave-length and equivalent electron energy 

X X v t = 12380. 

(In A.U.) (in voita) 

Atoms struck by electrons whose speed is not sufficient to cause 
ionization may nevertheless be made to emit radiation provided that 
the electrons have fallen through a potential exceeding a value called 
the excitation potential V 9 . Then equation (17) holds in reverse, i.e. 

hv<i eV (17') 
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All the agents here described can also dislodge electrons from the 
surface of a metal. Thus the impact of electrons or ions on a metallic 
surface releases “ secondary electrons ” ; light incident on a metal 
releases “photo-electrons”; heat causes “thermal emission of elec- 
trons The last-mentioned phenomenon is of particular interest. 
Just as an absorbed gas escapes from the absorbing body when heat 
is. applied, electrons may be “ evaporated ” from a metal by heating 
it to around 2000° C. The emission of electrons takes place at a lower 
temperature (600° C.) if the metal is coated with a thin layer of alka- 
line earth oxides. 

For completeness we must mention the possibility of the for- 
mation of ions in the course of chemical reactions. An example is the 
ionization of air by slow oxidation of phosphorus or other substances. 

4. The Separately-sustained Electrical Discharge. Spark Discharge. 

A self-sustaining discharge is one in which the passage of the cur- 
rent itself continues to produce the necessary carriers of the charge, 
while a discharge in winch these agents are produced entirely or in 
part by an external source of ionization, so that the discharge ceases 
when the source is turned off, may be referred to as separately sustained. 

The simplest example of a separately-sustained discharge is pure 
electronic conduction in a high vacuum. The electrons are liberated 
at the cathode either by radiation (photoelectric cell) or by heat, and 
move to the anode under the influence of the applied field. The maxi- 
mum value of the current is attained when all the electrons liberated 
from the cathode each second reach the anode. This current, which 
cannot be increased by raising the potential difference between anode 
and cathode, is called the 
saturation current. On the 
other hand, for small nega- 
tive values of the potential 
difference between the cold 
electrode and the hot one, a 
very few high-speed electrons 
will succeed in crossing the 
tube against the field. The 
voltage-current curve must 
therefore have the general appearance shown in fig. 1. The rigorous 
derivation for the linear case (extended parallel plate electrodes) was 
given by Gans. The theory is somewhat complicated, since it is not 
correct to assume a linear variation of potential from cathode to anode 
(which would, of course, be valid if the cathode were cold) on account 

* The work of escape from a metal is less than that required to free an electron from 
the atom. 




434 


THE THEORY OF ELECTRICITY 


[Chap. 

of the distortion of the field by the cloud of electrons surrounding the 
hot cathode. The, fact that no appreciable current passes when the 
potential difference is negative permits the use of the hot-cathode 
tube as an electrical valve (rectifier). If a wire grating or grid through 
which most of the electrons pass is interposed between anode and 
cathode, the electron current arriving at the anode depends very 
strongly on any difference of potential applied between the grid and 
the cathode. As a result, even a small alteration in grid potential 
causes the anode current to change considerably. This property — 
the ability of the tube to act as a regulator and amplifier — makes the 
three-electrode tube one of the most important devices in wireless 
telegraphy and telephony. - 1 

We now consider the case in which the gas between the electrodes 
is artificially ionized, say by X-rays, while nothing unusual is assumed 
to occur at the electrodes. In this case, too, we may expect a satura- 
tion current, viz. when all the ions formed in the gas in unit time reach 
the electrodes without previously recombining with oppositely- 
charged ions. If q ion pairs are formed in each cubic centimetre per 
second, and if the electrodes are l centimetres apart, the density of 
the saturation current is i, — qle. This seems paradoxical at first sight: 
the greater the separation of the electrodes, the greater the current. 
It is to be remembered, however, that this result refers to the highest 
attainable current, and that the potential must be raised when the 
electrode separation is increased in order to maintain the saturation 
current. 

We find experimentally that a new current increase occurs when 
the potential is raised further. This is due to the formation of new 
secondary ion pairs by the primary ions and electrons. Instead of 
treating this somewhat more complicated case, we shall investigate 
the simpler instance in which the primary electrons are formed at the 
cathode only, rather than in the gas. By means of light or some other 
agent, let electrons be liberated from each square centimetre of 
the cathode every second. Then the saturation current will have 
the density i, = w 0 e. Let eaoh electron form a. secondary ion pairs 
per centimetre of its path. Assume the velocity of the positive ions 
to be insufficient for ionization. If, in addition to the n 0 primary 
electrons, n t (x) secondary electrons pass through a cross-section at x 
each second — a total of n( x) electrons — then between x and x dx 

dn -- nadx (18) 

additional electrons are released each second. Integrating between 
x s=s 0 and x = l, we obtain the total number of electrons arriving at 
the anode 


= n 0 eF l i.e. * == en 0 e at . 


• ( 19 ) 
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This same value of the current density is equal to the total charge 
on the positive secondary ions arriving at the cathode plus that of the 
departing negative primary electrons. The form of equation (19) 
shows that the photoelectric current may he increased enormously 
by filling the cell with gas. 

We observe no sharp limiting current because there is a second 
possibility of ionization when the potential is increased, viz. ionization 
by collision of positive ions with atoms. Assume each positive ion to 
form 0 ion pairs per centimetre of path. Let w 0 primary electrons be 
released from each square centimetre of the cathode per second by 
some external agent, and let the total number of electrons arriving at 
the anode be %. Since no positive ions are liberated at the anode, 

is also the measure of the total current, which, at the cathode, 
consists of the electron current plus the ion current. Considering again 
a thin layer between x and x-\- dx (fig. 2 ), 
formed to the left of x per second and — 
neglecting infinitesimals — -(*4 — n) pairs 
formed on the right side. For a satura- 
tion current at equilibrium we must also 
have n electrons crossing the left face of 
the layer from the left, per second, and 
{% — n) positive ions crossing the right 
surface from the right. Hence the number ^ 

of electrons released on the path dx per second amounts to 

dn = [no + («i — n)ff]dx ~ [n (a — P) + ( 20 ) 

whence, by integration, 

w(o- £) + n?jS= .... ( 21 ) 


are n pairs of ions 
, , *», 
2*4 Be*. 

t<£ci 


We must have n — n 0 when x = 0 , so that 

(7=» 0 (a- fi) + n 1 p. (22) 

The single constant of integration is thus determined^, T3*e boundary 
condition for x = l leads to an equation for n^: 

- 0 + ihJ8 - K (<* - 0 + *»iffl . (23) 


Solved for tij, this is 




0 _^ e r.-aiT-- 


.... (23') 


It is interesting to notice that the denominator becomes zero if 

a = /3«<— or 1=^^. . . . (24) 

In this case even a very small value of n# is accompanied by a very 
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large (mathematically infinite) value of %. Then the function of the 
priinary electrons is only to initiate the process known as electrical 
brealcdown or spark discharge. 

The ionization coefficients a and ft will be proportional to the 
number of collisions on unit path and also to the kinetic energy acquired 
between two collisions. Since the number of collisions is proportional to 
the pressure, while the energy acquired on the free path is proportional 
to the ratio of the field strength E to the pressure p* we can write 

aOCP<f, (f)’ PoCp *(ji) {25) 

Remembering that E = V/l for plane electrodes, and designating by 
q a factor of proportionality which does not contain either V or p or 
l, equation (24) yields 

lp—Q 

In place of the ionization coefficients we now have functions of F, 
l and p in the relationship for the spark discharge. Considering this 
an equation for the spark potential F, we see that for plane elec- 
trodes the spark potential is proportional to the product of electrode separa- 
tion and gas pressure, i.e. to the amount of gas between the electrodes. 
This law' which was discovered empirically by Paschen, holds also 
for spherical electrodes if the radii of the spheres are increased in the 
same ratio as their distance apart. 

Paschen’s law is experimentally well substantiated, but the starting 
mechanism assumed here is not the only one that leads to this 
result. In fact, according to present knowledge, ionization by positive 
ions plays only a minor role, while indirect release of electrons from the 
metal of the cathode is much more important. The latter process 
comes about in this way: An electron not only causes ionization but 
— even before it has enough energy to do this — it excites the emission 
of fight from atoms with which it collides. This radiation releases 
electrons from the cathode photoelectrically, since the work of escape 
from the metal is less than that from free atoms. The condition for 
initiation of the discharge is that each primary electron shall liberate 
at least one electron from the cathode. Let y be the average number of 
electrons released indirectly from the cathode by one electron. Then, 
since the number of secondary electrons liberated in a distance l by a 
single primary electron is, by the above, e al — 1, we must require that 
y(e° * - 1) 1. (27) 

* For the energy is proportional to the product of field strength by free path, and 
the latter is inversely proportional to the pressure. 


, tiV/lp) 

log wm 

WM-uv/ip)' 


(26) 
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Since the same thing is true for the excitation of atoms to produce 
radiation as was said above for the coefficients a and /J, we see that y, 
too, is independent of F/Ip and Paschen’s law holds for this starting 
mechanism also. 

This assumed process seems to be valid up to pressures of about 
200 mm. of mercury. Cloud chamber photographs show that at higher 
pressures the discharge is initiated by “ canals ” which propagate 
themselves with the aid of irregularities in the field produced by the 
space charge due to positive ions (see next section). 


5. Self-maintaining Discharge: Glow and Arc Discharges. 

In the foregoing calculations the distortion of the field by space 
charge wa3 neglected in examining the initiation of discharge. Whether 
this is justified seems doubtful at the present time (see preceding 



1. Aston dark space 

2. ' Cathode sheath 

3. Crookes dark space 

4. Negative glow 

5. Faraday dark space 

6. Positive column 


section). At any rate, the steady forms of discharge, the main types 
of which are the glow discharge and the arc, are characterized by the 
fact that, on account of the large space charge, the field changes con- 
siderably after the discharge begins. In the present section we shall 
examine the properties of a stationary discharge without entering into 
a consideration of the very difficult question of what circumstances 
cause a disruptive discharge to pass over into a steady one. 

Fig. 3 represents the main features of the potential variation in a 
“ glow discharge ”, which occurs when low currents traverse a tube 
filled with gas at low pressure. The- various segments of the potential 
curve manifest themselves in part by different regions of luminosity. 
The nomenclature is given in fig. 3. The explanation, however, is very 
complicated, and will not be attempted here. The rapid change of 
potential at the cathode is called the cathode drop. Quantitatively, this 
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is defined as the difference in potential between the cathode and the 
point of minimum field strength, i.e. least potential gradient. Thia 
potential difference is independent of the current strength and of the 
gas pressure and depends only upon the nature of the gas and of the 
metal of the cathode, provided the current remains below a certain 
value. This limiting condition may be recognized visually by the fact 
that the cathode is just covered by the negative glow at that stage. 
The order of magnitude of the normal cathode drop is 200 volts. The 
cathode drop 'is the vital feature of a glow discharge; it corresponds to 
approximately the lowest potential with which a discharge can be 
maintained. What, then, is the source of the great lack of symmetry 
between cathode and anode? It is based upon the difference between 
the electron and the positive ion; this difference is responsible for 
quite different conditions for positive and negative space charges. On 
account of the relationship 

eV = eElf = Itwp 2 (28) 

(l f — mean free path), an electron attains a velocity \/l840A times 
as large as a positive ion of atomic weight A in falling through the 
same potential difference. Moreover, on account of the vanishingly 
small size of the electron, its free path l f is greater than that of the 
positive ion. Thus the electron engages in fewer collisions and so can 
accumulate more energy. According to the kinetic theory of gases, 
the mean free path of an atom of radius r 0 projected into a cluster 
of stationary atoms of radius r x , of which the number per cubic centi- 
metre is n v is given by 

fr-SSTTTO « 


If these atoms are also in motion with velocities approaching that of 
the impinging atom, the factor 1/V2 must be supplied in the right 
member. The velocity attained by the ions is of the order of magni- 
tude of the gas-kinetic velocity, so that the second assumption is valid 
for them ; the electrons, on the other hand, move" so much faster that 
we can consider the gas atoms relatively at rest. Hence for the elec- 
tron (r 0 = 0) the mean free path is 



while for an ion, 

It = 1 =. 


(30) 

(31) 


In a given field the velocity of the electrons is to that of the ions as 
V1840A-4V2 ; 1, or as 102VZ:1. 
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The region of the discharge which is free of space charge is charac- 
terized by linear potential rise, for Poisson’s equation (p. 268) de- 
mands that cPVjda 2 vanish where there is no space oharge. Hence, 
at such places. 


i — i+ + i- — en + v + + erajw_, 

so that 

5= = — = 102-y/Z. (32) 

» + v + 

In the region of no space charge, i.e. essentially in the positive 
column, the portion of the current carried by the electrons is at least 
one hundred times that carried by the positive ions. 

If we now assume for simplicity that the current at the cathode is 
carried only by positive ions, that at the anode by electrons alone, 
and that the field is the same on both sides, then — since * — Nev — 
th e rat io of the space charges Ne before cathode and anode is at least 
Vl02 A : 1. Therefore, on account of Poisson’s equation 
/72F 

-fas — — = —4 irNe (33) 

the potential curve must turn sharply downward at the cathode and 
slightly upward at the anode. Naturally, these considerations are only 
qualitative; in reality, part of the current at the cathode is carried 
by electrons dislodged from the metal by the impact of ions which 
have attained considerable speed in traversing the cathode fall. Pre- 
cisely this liberation of electrons is necessary for the maintenance of 
the discharge. 

In the absence of an external resistance for limiting the current, 
the glow discharge can go over into the arc form, in which the current 
is several hundred times greater. The arc is distinguished from the 
glow by the fact that the heating of the cathode causes it to release a 
much greater electron current that would be furnished by positive 
ion impact alone. Thus the arc type of discharge presumes the use 
of a hot cathode.* 

In many arcs the temperature of the gas is so high that thermal 
ionization is of considerable importance in addition to ionization by 
electron impact. It follows from these remarks that the number of 
charge-bearing particles increases considerably as the current is made 
greater, since both the thermal emission of the cathode and the 

* There are certain forma of electric are whioh do not employ thermal emission. 
In these the electron emission takes place under the influence of very strong eleotrio 
fields. 
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thermal ionization of the gas increase exponentially with the tempera- 
ture. This causes the arc to have a so-called “ falling characteristic 
the resulting potential difference V across the arc and the current 1 
are related by an equation of the form 

V — a + j, (34) 

which is the exact opposite of the ohmic relationship V = IR. 



A conductor having a falling characteristic V — f(I) must not be connect c 
directly to a source of potential, as may be seen by considering fig. 4. If, through 
some cause, the current should increase, the E.M.F. opposing the battery potential 
diminishes; the current then increases, causing the potential to drop further, 
resulting in an additional increase in current. On the other hand, for a conductor 
with a rising characteristic (e.g. an ohmic resistance), the required potential 
rises along with the current, so that the current returns to the equilibrium value, 
since the battery is incapable of giving the required E.M.F. when the current 
increases. One must therefore make certain, by inserting a conductor having 
a rising characteristic, that the characteristic of the entire system is an increas- 
ing one. 

6. The Origin of Cathode, Canal, and Positive Bays. The Mass Spectro- 
graph. 

Having obtained a broad view of the phenomena of the glow dis- 
charge, we are able to understand the origin of the various rays which 
are found in a discharge tube under favourable conditions. It must 
be remembered that there is a strong field near the cathode, and that 
the positive ions attaima considerable velocity in this field. The cathode 
rays, recognizable at low pressures by their fluorescent effects, are 
a part of the electrons released from the cathode by positive ion unpact, 
viz. those which have traversed the entire potential fall without ap- 
preciable energy loss by collision. But in order to maintain the dis- 
charge, it is necessary that a certain number of the liberated electrons 
produce new electrons and ions by impact in the gas. 
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The positive ions which have attained high speeds in the cathode 
drop can be made visible in the form of rays by providing the cathode 
with small holes and making the space behind the cathode field-free. 
Part of the ions then pass through the holes in the form of “ canal 
rays ”, where they gradually lose their speed by impacts with the 
randomly-moving particles of the gas. The principal difference be- 
tween canal rays and cathode rays is that the positive ions consti- 
tuting the former are self-luminous, while the electrons are capable 
only of causing other substances to radiate. The Doppler Effect 
(p. 235) may be perceived in the light of the canal rays, as first recog- 
nized by J. Stark. As in the case of cathode rays, the velocity v and 
specifio oharge e/M may be determined by electric and magnetic de- 
flection. The results show — as may be expected — that the speeds are 
not alike, since the distances traversed by the various particles vary 
considerably. 

Rays of positive ions can be generated at the anode also if con- 
ditions are so arranged that a greater anode fall results, and if the 
anode is made of a material capable of emitting positive ions, e.g. a 
metallic salt. As mentioned above, the smallness of the anode fall is 
due to the high speed of the light electrons. If now the inertia of the 
negative charges is artificially increased by attaching them to atoms, 
the resulting field at the anode must be similar to that before the 
cathode. This loading of the electrons may be promoted by intro- 
ducing electronegative atoms, into the gas, i.e. such atoms as the 
halogens, which have a tendency to form negative ions. This is done 
in practice by using a halogen salt as anode; the cations of the salt 
pass off as positive rays while the halogen evaporates as a neutral atom. 

The determination of e/M for canal rays and for positive rays is 
of particular importance, for it makes possible the determination of 
the atomic weight of individual ions. It is assumed, of course, that 
the charge on the particle is known; this must consist of an integral 
number of elementary charges, and there is usually no doubt as to 
the exact number. We see from equation (8) (p. 427) and equation (15) 
(p.’ 429) that the electric deflection is the same for all particles which 
have traversed the same potential drop, while the magnetic deflection 
depends upon e/M as well as upon the potential difference V. This 
circumstance may be used to overcome the troublesome lack of homo- 
geneity of the canal ray particles which, of course, have fallen through 
various potential differences. By means of an electric field, the par- 
tioles are first sorted according to V ; then, by means of a slit S 2 (fig. 5), 
a narrow region containing only ions which have traversed the same 
potential drop is isolated; finally, by means of a magnetic field, the 
particles are sorted according to mass. This is the scheme of F. W. 
Aston’s “ mass spectrograph ”. The sharpness of separation depends 
upon the homogeneity of the rays, which, in turn, depends upon the 
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narrowness of the slit; this is limited by the fact that the intensity of 
the beam must not be made too low by undue reduction of the slit 
width. Aston’s idea was to focus the rays in such manner that rays 
within a given range of potential difference would be united at the 
photographic plate which serves to record them. Integration of equa- 



Fig.j 


tion (5) (p. 427) gives the angular deviation in an electric field of length 
o, provided the angle is small; 


_dy _ eEa _ Ea 
a ~fo~Mv 0 *~ 2P 


. (35) 


and the magnetic angular deviation in a field of length b is, by (15), 
p. 429, 




J M ‘ oy/2T 


(36) 


Call the distance between the mid-points of the electric and magnetic 
fields Zj and let the distance travelled by the rays from the middle of 
the magnetic field to the detector be l 2 . According to Ex. Ill (p. 431), 
we may consider the rays to come in a straight path from the centre 
of the field. The entire deflection, measured in a plane normal to the 
original direction of motion, is then given by 

h = (l x + k)* + kP- ..... (37) 


If it happens that the potential drop traversed by the rays is not 
strictly constant, but varies by an amount dV . then the angles of de- 
flection lie in an interval 


da 


dV 

V 


dY 
2 V' 


• (38) 


as may be seen by logarithmic differentiation of equations (36) and 
(36). This means that rays within this interval which have fallen 
through different potential differences but have the same e/M form a 
pencil of cross-section 




- ( 39 ) 
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This cross-section is reduced to zero by making the expression in 
brackets vanish. This can happen only when a and fi have opposite 
signs, i.e. when the magnetic deflection is opposite in direction to the 
electric. Set fi — — y. Since \ is fixed, and since a also has a fixed 
value for a given potential, the Only remaining variable is the. distance 
l 2 from the mid-point of the magnetic field to the point where the rays 
are received. The bracketed expression set equal to zero gives the 
equation of the focal curve in polar co-ordinates 1%, y — i.e. the curve 
on which the pencils corresponding to various values of eJM have 
zero cross-section. This equation is 



The polar axis is the direction of the magnetically undeviated ray 
(see fig. 5). If we choose a new axis making an angle 2a with this one, 
the angle measured from the new axis is 8 = y — 2a, and tie equation 
of the focal curve takes the simple form 

l t e = l 2 sm.e = 2l l a (41) 

This is, in fact, a straight line parallel to the new polar axis and dis- 
tant from it 2aLy. This line is the intersection of the photographic 
plate with the plane of the figure when the plate is properly placed to 
receive the rays of a given e]M in sharp focus. The magnetic field has 
a dual function: it separates the rays corresponding to various masses 
and unites (focusses) the rays corresponding to particles having the 
same mass but originating at different points in the cathode drop. 

This “ classical ” example of a mass spectrograph, since surpassed 
by other designs, is one of the first instances of the use of electron optics. 
The term “ focussing ” has been current ever since. Electron optics is 
based on the formal correspondence between light paths in various 
refracting media and particle orbits in fields of force (cfT p. 787 et seq.). 
Following this analogy through, individual fields of force may be com- 
puted as “lenses” (as in Ex. 112, p. 431), which are the basic elements 
of electron-optical instruments such as the well-known electron micro- 
scope. 
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The Fundamental Ideas of the Theory of 
Metallic Conduction 

I. Electrons as Carriers of Current in Metals. 

There are only a few solids which, like Cul, exhibit good electro- 
lytic conductivity. The current in such substances is carried by 
material ions, as may be seen by the deposits formed at the electrodes. 
The conductivity of that class of substances known as metals is many 
orders of magnitude greater — in fact, an essential characteristic of 
a metal is high electric conductivity unaccompanied by any transport 
of matter. It is therefore likely that the current is carried by elec- 
trons which pass through the meshes of the atomic or ionic lattice. 
From the validity of Ohm’s Law for metals we conclude, as in the case 
of electrolytes, that the reaction between the electrons and the lattice 
particles may be looked upon as a frictional foreff rv proportional to 
the velocity, so that the relationship 

— eE = rv . . (1) 

is again valid.* The direct proof that the current is carried by electrons 
in metals was given by Tolman in his determination of the value of 
e/m for the conduction electrons. Since an electron has inertial mass, 
this ratio must be determined from experiments in which the inertia 
of the electrons manifests itself. One might imagine, for example, an 
arrangement in which a metallic conductor is freely susuended. If 
the conductor is at rest while a current is flowing, and if the current 
is suddenly interrupted, the electrons continue to possess a certain 
amount of momentum which is destroyed by friction within the body. 
According to the principle bf the Conservation of Momentum, the body 
as a whole must then experience a displacement. The opposite effect 
shows more promise of verification, for practical reasons: mechanical 
acceleration of the conductor should cause relative motion of the 
electrons, and therefore a current. Since we have very sensitive in- 
struments for measuring feeble currents, this method is easier to 


1 The negative sign arises from the negative charge of the electron. 
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apply; Tolman actually determined ejm in this way (1914-23). We 
examine the case of an infinite straight cylindrical conductor which 
experiences a constant acceleration a in the direction of its length. 
According to p. 231, each electron is subject to an inertial force 
— ma, and under the influence of this force and friction it attains 
a velocity 


ma 



( 2 ) 


If there are N (negative) electrons per cubic centimetre, the density of 
the resulting current will be 

• . Nem 

* = -f — a. ....... (3) 


Now, on account of the relationship 


since we have always 
we find 


i = —Nev, 


Ne 2 E 

r , 

.... (4) 

i = oE, . . . 

.... (5) 

c- Ne * 

r 

.... (8) 


Hence the current which results from the acceleration is given by 
the equations 


Assume now that in place of a straight conductor we have one of 
circular form that we accelerate by oscillating it around an -axis norma) 
to the plane of the circle. While it is true that the acceleration is no 
longer constant, (7) may be applied at any instant, understanding a 
to be the instantaneous acceleration. The direction of a, and hence of 
i, is understood to be along the tangent. We then have circular cur- 
rents, and the oscillation results in a magnetic field which varies with 
time. Its induction effects on a secondary coil may be computed by 
the results of p. 312, exactly as for a normally produced alternating 
current. Since (7) contains ejm as the only unknown, Tolman was able 
co calculate its numerical value. Within the limits of error, it agreed 
with that for free electrons. 

The ‘next question is: “ What is the state of the conduction elec- 
trons in metals ?” The simplest and most fruitful assumption is that the 
electrons move about through the framework of atoms, like atoms of 
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a gas. We lave reason to believe that the number of free electrons is 
of the same order of magnitude as the number of atoms, so that the 
obstacles between which the electrons swarm are atoms with positive 
charges. This concept of an electron gas was used in Chap. XXIII, 
§ 3 (p. 431) to obtain a picture of the process of thermal emission 
of electrons. With this concept as a basis, we shall endeavour to 
obtain more information about the formally introduced frictional 
constant r. 


2. Derivation of Ohm’s Law for Metals. 

As a result of the heat content of the metal, the conduction elec- 
trons within the metal are in a state of chaotic thermal agitation, 
exactly as if they were atoms of an ordinary gas. Let the mean velocity 
of this motion be v w . For simplicity ip the calculations we deal with 
a single average velocity, while actually the velocities have a certain 
distribution about this mean value, as in the case of a gas. If an ex- 
ternal electric field is applied, a component which is along the field 
and which is the same for all electrons is added to the disorderly 
motion. The electron will not continue to take energy from the field 
indefinitely, but will soon lose the acquired kinetic energy by collision 
with an atom. We shall refer to a collision in which an electron gives 
up all its acquired energy to an atom as an effective collision. In 
addition, there may be elastic collisions which we need not consider. 
We call the distance traversed between two successive inelastic impacts 
the mean free path l. As long as we may regard the additional velocity 
imparted by the field as small compared with the thermal motion, 
the time between two inelastic impacts is given by 



By Newton's second law, the velocity attained in the field after r sec is 



Hence the average directed velocity component between two col- 
lisions is 


eJB'r eJSl 

2 m ~ 2v u m‘ 


m 


On account of the complete disorder of the thermal motion this effects, 
on the average, no transport of charge. In computing the current 
density, then, only the additional velocity acquired in the field need 
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be taken into account. If there are N free electrons per cubic centi- 
metre, the magnitude of the current desity is 


= — Nov, — ^ — — = oE. 


( 11 ) 


Since none of the factors composing a depends upon Lie field E, we 
have here the proportionality between current density and field 
strength, which is equivalent to Ohm’s Law. 

With regard to the temperature dependence of N, l and v w , we may 
say the following: According to our ideas concerning the structure of 
metals, N must in. any case be temperature-dependent. Further, con- 
sidering the electrons to form a monatomic gas, we have, according to 
the statistical law of equipartition (c£. p. 696), on the average, at a 
temperature T 

(12) 


where k is the Boltzmann constant, or “ gas constant per molecule ”, 
E/L. Thus increases with s/T. If we make use of the experimental 
fact that the conductivity of all pure metals is inversely proportional 
to the absolute temperature, we can conclude that l decreases with 
1 IV'T, so that the product lv w is independent of the temperature. 
Qualitatively, we can see at once that the temperature affects each of 
these quantities oppositely: an increase in temperature causes the 
lattice elements to vibrate with greater amplitude. This increases the 
number of collisions between electrons and atoms, causing a reduction 
of the mean free path. On the other hand, the mean thermal velocity 
increases. However, according to the newer theory (cf. next section), 
v w must be essentially independent of temperature and l must decrease 
as 1/2'. 


3. Conduction of Heat in Metals; the Law of Wiedemann and Franz. 

The fact that the thermal conductivity of metals is many orders 
of magnitude greater than that of solid dielectrics suggests that this 
property, too, is determined by the free electrons. If this is so, we can 
compute the thermal conductivity of metals on the same basis of an 
electron gas and bring it into relation with the electrical conductivity. 
For the sake of 'simplicity, we again ignore the statistical distribution 
of velocities and of free paths and use the single magnitude v, corre- 
sponding to the quantities denoted by above, and the single value 
Z. On the other hand, we are not justified in assuming that the com- 
ponents of v are equal for all electrons. Assume a temperature gradient 
in the direction of the z-axis. Consider the electrons passing in both 
.directions through a cross-section at a height z. We assume that the 
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energy of these electrons corresponds to that of the layer in -which 
they experienced their last collision. If we denote the energy of an 
electron at height %, by and call the temperature ,T 0> then 

u (z) — « 0 -f ^jjj^s “ i kT o + 5 * J z 2. - . (13) 

The electrons passing thrcngh each square centimetre of the section 
z,j every second transport a hotel amount of energy 

s - s ”V-.[“.+ (|);] 

where n Zt is the number of electrons per cubic centimetre which have 
the ^-component of velocity v t . The summation is to be extended 
over all % which occur, and over all possible, layers *. These, however, 
are not independent. If we have uniform velocity v and if we replace 
the distance between the plane z = 0 and the place of the last col- 
lision by the .mean free path, we have 

cos(/fco) — tend— . . . , (.15) 


Again, the expression vanishes, otherwise there would bo an 

electric current — e£»„ v x flowing through the section z— z p . The 
thermal energy transported each second is 


a hi„v t* dT __ 
2 l l,z 


Now the spatial average of cos 2 9 is 1/3: thus the energy flux is given by 

...... (IT) 


The thermal conductivity is defined by 



o„ --= }¥Nh. 


Here again we do not know the manner in which the separate factors 

* It weai at first remarkable that wt may put the mean free path I tor the distance 
between the planes r, — % and * — z w Bines all tie electrons do not experience their 
next oolhdon in the -plane a » v This will bo dhecaed in more detail on p. 670, 
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depend upon the temperature. However, if we form the ratio to the 
electrical conductivity, a simple expression results: 

2 r. . (19) 

This expresses the law of Wiedemann-Franz, which states that the 
ratio of thermal to electrical conductivity is proportional to the absolute 
temperature. This law is well substantiated experimentally, and the 
factor of proportionality agrees, within certain limits, with the value 
3 {RjFf. 

4. Objections to the Theory Developed Above. The Electron Theory of 
Pauli and Sommerfeld. 

Although the concept of an electron gas accounts for many pheno- 
mena of metallic conduction, the fact that some of its consequences 
lead to gross contradictions of experience must not be overlooked. 
Primarily, there is the question of the contribution of the electrons to 
the energy content, and hence to the specific heat, of the metal. 
According to the Law of Equipartition of Energy (p. 596), there 
corresponds to every degree of freedom which contributes to the 
energy of a system an amount of energy hTJ 2 per molecule. Assuming 
as above that the number of free electrons is of the same order of 
magnitude as the number of atoms, the energy content of the metal 
must be increased by (3/2)5T per inol, corresponding to the three 
components of the translatory motion of the electrons. Then the 
specific heat per mol. 



does not have the value 65/2 found for higher temperatures (Law of 
Dulong and Petit), but amounts to 95/2, which is completely at 
variance with experience. This was first cleared up by recognizing 
that the electron gas has an enormous number of particles per unit 
volume — one electron per atom, or as many individuals per unit 
volume as there are atoms of the solid metal. Thus it is not legitimate 
to treat an electron gas by ordinary statistics. The theory developed 
by Pauli and Sommerfeld will be discussed later (Chap. XXXVIII). 





CHAPTER XXV 


Electron Theory op the Dielectric Constant, 

Index op Refraction and Magnetic Permeability 

I. The Origin of Electrical and Magnetic Polarization. 

While the dielectric constant and the permeability (or the electric 
displacement vector D and the magnetic induction vector B) are the 
quantities used in the field theory, they have no immediate physical 
interpretation in the atomistic theory of electrical phenomena. It is 
the polarization, i.e. the resultant dipole moment per cubic centimetre, 
whose interpretation can be given by the atomistic theory. The 
polarization P — in the magnetic case, the magnetization M— -depends 
upon D and E or upon B and H according to the relationships given on 
pp. 280, 298: 

D = E + 4tcP, B==H + 4vM. 

The polarization vector is parallel to the field vector for isotropic 
bodies; for anisotropic substances (crystals not belonging to the 
regular system) P is a linear vector function of E. We restrict our 
present considerations to isotropic substances, for which the con- 
nexion between P and E or between M and H is given by a simple 
scalar magnitude — the susceptibility k . 

A resultant dipcie moment can originate in two ways in either the 
electrical or the magnetic case. Firstly, the molecules of the sub- 
stance may themselves possess a moment. In this instance the axes 
of the dipoles are randomly oriented, so that there is no resultant 
moment. Application of an external field causes the particles to assume 
the direction of minimum energy, i.e. to set themselves parallel to the 
field. This adjustment, however, is opposed by the thermal agitation, 
so that the alignment can become perfect only for an infinitely strong 
field or when the temperature is reduced to absolute zero. It is evident 
that this kind of susceptibility depends strongly upon the temperature. 
In the magnetic case the term “ paramagnetic ” has been applied for 
a long time; in the electrical case the differentiation is less obvious (see 
below), and so this distinction has come to be applied only recently. 
The- corresponding term “ paraelectric ” or “ parelectric ” is there- 
fore not yet in universal use. 

480 
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The other possibility is that the dipolq moment of an individual 
molecule is induced by the field itself. This process is easy to compre- 
hend in the electrical case. The electronic shells are not rigidly attached 
to the core, so that an electric field causes a relative displacement of 
the two. On account of the fact that the charge on the electron is 
negative, the shells are moved in a direction opposite to the field, but 
since the dipole moment is defined as the limit of the product of charge 
by separation, multiplied by the unit vector drawn in the direction 
from the negative toward the positive charge, the moment itself has 
the direction of E. Hence the resulting “dielectric polarization” is 
in the direction of E. In the electrical case the dielectrio and par- 
electric polarizations have the same sign. However, since the dielectric 
polarization does not involve molecular orientation, it does not depend on 
the temperature. Dielectric polarization occurs also for parelectrio 
substances, since a relative displacement of positive and negative 
charges can occur here also. The distinction between the two forms 
will become clearer in the following paragraph. 

The origin of “ diamagnetism ” is somewhat more complicated. 
As shown on p. 664, an applied magnetic field imparts to the electrons 
an additional rotation about the direction of the field. This so-called 
Larmor precession takes place with angular velocity to = eE/2mo. 
This motion of the electrons gives rise to a magnetic moment which, 
as exact calculation shows, is in a direction opposite to the field. Thus 
the diamagnetic and paramagnetic susceptibilities are opposite in 
direction, and so the distinction between the two forms in the magnetic 
instance is the well-known one. The quantum theory requires that the 
paramagnetic moment of an atom be a multiple of a certain magnitude, 
the magneton; this is so large that for paramagnetic substanoes the 
diamagnetic susceptibility which is always present is relatively in- 
significant. 

2. Theory of Dielectric Polarization, Optical Index of Refraction and 
Dispersion. 

Assume that the substance under consideration contains N atoms- 
or molecules per cubic centimetre, these particles themselves having 
no dipole moment. Suppose further that the distances between par- 
ticles are so great that their mutual influences may be neglected. In 
a field E every particle assumes a moment, which we may take as pro- 
portional to the field, for we are dealing with small displacements of 
the electronic systems, and may take the restoring forces to be pro- 
portional to the displacements. These forces hold the system in equili- 
brium in the field; hence the elongation, and thus the dipole moment, 
is proportional to ths field strength. We then have 

£ = <*E, 


( 1 ) 
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and the polarization becomes 

P = kB = NaE (2) 

The dielectric constant is then given by 

K = « 2 = 1 + 4t rNa, (3) 


where the Maxwell relationship K — n 2 is assumed (p. 331). Inasmuch 
as we shall limit our present considerations to matter of very low 
density, we may replace (n s — I) by 2 (n — 1) and obtain an approxi- 
mate formula for the refractive index: 

» — 1 on 2 nNa (3') 

Since we assume a quasi-elastic restoring force, the application of an 
alternating field will cause displacements of the electron shells which 
depend upon the frequency. Assuming further that the polarization 
of each atom is caused by the displacement of an individual electron 
from its position of equilibrium, the equation of motion of the electron 
is that of a forced harmonic vibration: 

d 2 s 

m aJ +*•“—“*«**■*• (4) 


The stationary solution is, according to p. 97, 


eE 0 e** 
h - mo? 


(5) 


or introducing the natural frequency co 0 — VX/ni, 

, _ eE ^‘ 

m{<o 0 2 — « 2 )' 

Hence the dipole moment becomes 

and the polarizability is given by . 

_ e 2 jtn 

cc 0 2 — CJ 2 ' 


( 6 ) 

(7) 

(8) 


If the number of electrons is / instead of one, the value is / times at 
great and the refractive index is given by 

Mtf- >' 2 ) ' ‘ ( ) 


» — 1 ==• 


where <a has been replaced by 2irv. 

* The effect of the magnetic vector of the light wave, given by equation (9) (p.428 ) 
may be neglected, since the resulting electron velocities are small with respeot to e, 
while E and H are of the same order of magnitude in our system of measurement. 
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Even this simple dispersion formula contains all the essential 
properties of the optical index of refraction. The characteristic fre- 
quency v 0 is to be interpreted as the absorption frequency of the atom, 
since for this applied frequency the response of the atom is so great 
that all energy is taken from the primary ray and scattered in every 
direction. Now the absorption line® of free atoms lie almost ex- 
clusively in the ultra-violet region of the spectrum. Hence if we trace 
the course of the index in the visible region, v < v e , and so the re- 
fractive index is always greater than unity. As we pass to shorter 
Wave-lengths the frequency increases and we approach the charac- 
teristic frequency; the difference v 0 2 — v 2 becomes smaller and smaller 
and the index increases toward short waves. This is the usual be- 
haviour for glass and for liquids, and is called normal dispersion. 
Beyond the absorption band the index becomes less than unity accord- 
ing to equation (9), and so the rays in that region are refracted less than 
the longer ones. This reversal of the usual prismatic sequence of 
colours was first observed by Christiansen, using fuchsino, and is 
called anomalous dispersion, although from the point of view of the 
theory there is no anomaly whatsoever. 

The dispersion formula may be improved in several respects. 
Firstly, the fact that an atom or molecule may possess not one, but 
perhaps a large number of absorption lines must be taken into con- 
sideration. Each one contributes an oscillator whose strength is pro- 
portional to the intensity, so that we write 


, Ne*lm^ Si 
1 “ ~2W~ 


(10) 


Moreover, the oscillations of the quasi-elastically bound electrons 
are not undamped; the radiation resulting from the oscillation (see 
p. 341) involves a certain, damping which prevents the index from 
actually becoming infinite at the absorption lines, as in equation (10). 
If the damping Is taken into account, the index becomes complex and 
the substance behaves like a metal near the regions of absorption. 
We shall, not treat this case in more detail here. 

It is more important to take into consideration the mutual inter- 
actions of the dipoles, which cannot be neglected when we consider 
denser substances, such as liquids or solids. In such cases there is a 
microscopic field F at the place occupied by any. selected electron. 
This field is not the same as the macroscopic field E defined, for 
example, by the difference of potential across a condenser. But in 
any case the susceptibility is defined as the ratio o£ the polarization 
to the macroscopic field strength. The difference between E and F is 
that all the charges contribute to the macroscopic field, while in 
calculating the field E< acting upon the dipole under consideration, the 
latter is not to be included (of. footnote, p. 266). In order to calculate 
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the field of neighbouring dipoles, which is added to the applied field, 
we circumscribe a small sphere about the point occupied by the atom 
under consideration and examine first the contribution of the dipoles 
lying outside the sphere — i,e. we imagine all material removed from 
within the sphere. There is a surface charge upon the sphere, and the 
additional field has the value +4 ttP/ 3, according to p. 283, where we 
had the same surface charge, but with opposite sign (see also Ex. 75, 
p. 284). According to a calculation made by Lorentz, the contribution 
of the particles within the sphere vanishes either when there is com- 
plete absence of order or when the atom under observation is a point 
of a regular crystal lattice, i.e. for all isotropic bodies. Then the dipole 
moment of the individual atom or molecule is 


•p — a^E -f- —■ ?), ..... (11) 

and we obtain the equation 

P=tfa^E+~P^ ...... (12) 


for the polarization. Solving this equation for P yields 
NaE 


(13) 


Solving for Na shows that the simple relationship between N, 
K (or n 2 ) given by equation (3) (p. 452) is to be replaced by 
4w N K- 1 n 2 - 1 

3 J * a ~K + 2 n 2 +2' * * * * 


(14) 


Hence for denser substances the dispersion formula is 
n 2 - 1 Ne 2 jm ^ f t 
n 2 + 2 3 tt * 


(15) 


As may be verified immediately, this formula becomes identical with 
equation (3) for n an 1. It is customary to deal with one mol of the 
substance, rather than with 1 c.c. Let the density be p and the mole- 
cular weight M. Then 1 gm. contains LjM molecules, and 1 c.c. con- 
tains 


<“> 


Using this expression in equation (15), we obtain 

n 2 - 1 M Le 2 /m /, 

n s + 2 ° p 3tt ‘ v 0 8 — i' 2 ' 


. . ( 17 ) 



XXV.] ELECTRON THEORY OF PHYSICAL CONSTANTS 455 


The quantity on the left, which has the dimensions of a volume, is 
called the molecular refraction. 

Extrapolation of. equation (15) to infinitely long waves, i.e. to 
i» = 0, gives the following equation for the static value of the dielectric 
constant: 


K- 1 Nfjm y 
E+2~ 3rr ** v 0 ,r 


{15') 


For many substances this does yield the true value, but for others — 
e.g. water — there are large discrepancies. The reason is that this 
method gives only the dielectric part of the polarization. As Will be 
seen in the next section, the pareleceric portion vanishes for optical 
frequencies, on account of the fact that the oharges are not able to 
follow the field at such frequencies. Thus substances for which (15') 
gives thd correct value of K are free of dipoles. 

Ex. 113. The refractivity of a substance of mass m and density p is, in anal- 
ogy with the molecular refraction given by the expresnioc 

»*-£ 2‘ p' 

Show that the refractivity of a mix ture is the sum of the refractivities of the 
components. 

Ex. 114. Formerly, the molecular refraction was often taken to be the actual 
volume occupied by the molecules. As a matter of fact, one obtains numerical 
values which agree well with the kinetic theory values. Show that this is so, 
for example, for water (?i -= 1-33). Show also that the numerical valves of polariz- 
ability are those of metallic spheres of the gas-kinetic size (of. p. 284 ). 

Ex. 115. Calculate phase velocity and group velocity of eleotro-magnetio 
waves in a medium containing N free electrons per unit volume (Heaviside Layer 
of the upper atmosphere). 


3. Parelectric Susceptibility. 

As explained in § 1 (p. 450), the application of a field to a substance 
whose molecules possess an electric moment generates polarization by 
lining up thebe dipoles. This phenomenon occurs only if the mole- 
cules are free to move — i.e. principally in gases and in liquids — and 
if the external field reverses slowly enough to permit the system to 
come to equilibrium between reversals. The parelectric contribution, 
which is responsible for the high value of the dielectric constant of 
water, vanishes at frequencies corresponding to wave-lengths of a few 
centimetres. In order to find the extent to which the dipoles are 
aligned in terms of the temperature and field strength we make use of 
the statistical theorem (p. 588) that the number of particles of a system 
which are in a state of energy u is proportional to 
e~ ull,T , 
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where k is the Boltzmann constant. Now the potential energy of a 
dipole in a field E is given by 

u= — fiE = —pE cos 6 (18) 

If we denote the direction of the dipole axis by a point on the unit 
sphere, the number of these points which lie in the zone 8 to 8 + d8 
(9 = co-latitude) is proportional both to the area of the zone and to 
e -'u/*r. 

We may therefore set 

dN=Cef Ea » 9 i kT sm8de (19) 

the factor 2ir being contained in C. The value of this constant is 
determined by integrating to obtain the total number N of particles 
per unit volume: 

N = cfee EcmSfl ’ iT Bm8d6. . . 

Make the substitution 

pE cos 9 

-w—”' 

then 

„ n kT r+PBIMT 2 CkT.,pE 


( 20 ) 


(21) 


• (22) 


pE J-pEIkT 

Solving for C and using this value in equation (19), we have 
dN — — — - e pE coa elkT sin 6d9^ 

If the axis of a dipole forms an angle 6 with the field, its contribution 
to the resultant parallel to the field is p cos 8. The components normal 
to the field cancel out in summing over all dipoles, and we obtain the 
resulting dipole moment per cubic centimetre, i.e. the polarization, by 
integrating over all directions: 

p _ Np 2 E 


2 kT sinh- 


pEJ 
kT 

Making the same substitution a 
p _ NpkT r**™* 
2pEmnh** J -*M- 


f gpEeott/MT C( 


( 23 ) 


above, this becomes 

NpkT (we a — e")| 


+pE/i>t 

■PBIMT 


2 pE sinh ^ 
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• j 00 ** 1 w (s) 


I* . (24) 


Following Langevin, to whom this theory is due, we have designated the 
function coth x — Ijx by a single symbol, say L(x). We see that L(x) 
approaches unity for large values of x; a power series development 
shows that the function behaves like m/3 for small values of x. Since 
Np is the maximum value (saturation value) of the polarization, 
attained when all the dipoles are aligned with the field, the limit must 
be unity for large values of the argument if the result is to have any 
meaning at all. For weak fields and for temperatures which are not 
too low — conditions which are satisfied in most experiments — the 
approximation ;s/3 is sufficient and, since P is always in the direction 
of E, we obtain 

P=- 

That is, within certain limits, the. parelectric susceptibility ia also 
independent of the field, and we can apply the formula! of § 2 
if we replace a by p^j'SkT. Taking account of the interactions of the 
dipoles, equation (14) (p. 454) then gives for the parelectric part of 
the dielectric constant 


3 kT 


(25) 


K — _I __ inNp* 
K+ 2 ' ‘ 9 kT ’ 


(26) 


and likewise the molecular polarization 

K— 1 M__4 
K 4- 2‘ p ~ 9 kT' 


. . (27) 


In deducing the formula for the parelectric polarization we made 
the natural assumption that the dipoles could assume all possible 
orientations with respect to the field. According to the principles of 
the Quantum Theory, however, only a discrete set of angles of orien- • 
tation occurs. Nevertheless, the strict quantum theory calculatioa 
yields only small deviations from the classical Langevin formula. Of 
much greater importance is the “ directional quantization M in the 
magnetic case; this will be treated below. 

On account of the fact that the adjustment cannot take place 
freely, solids exhibit parelectric susceptibility only in the neighbour- 
hood of the melting-point. Nevertheless, crystals composed of Ions of 
different signs can show a different kind of polarization which is due 
to relative movement of the ions as entities. This polarization also 
vanishes at optical frequencies. 

Ex. H6. Given the dielectric constant K = 80 for T = 290° and the optical 
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index of refraction n = 1-33, calculate the dipole moment of the water molecule, 
assuming that the dielectric and parelectrio polarizations are additive. 

4. Paramagnetic, Ferromagnetic and Antiferromagnetic Susceptibility. 


Molecules are the only particles which can possess electric moments, 
since for all atoms the centre of gravity of the negative charges coincides 
with that of the positive charges. On the other hand, a free atom can 
have a magnetic moment. There is another point of difference between 
paramagnetism and parelectricity : from the fact that the paramagnetic 
susceptibility of salts does not change much when the salts go into 
solution we may conclude that the solid state offers no hindrance to the 
alignment of the dipoles. 

If we now assume that all orientations are equally allowable, we 
need only translate the formulse of § 3 (p. 457) into magnetic terms, 
with the understanding that p now represents the magnetic moment 
of the atom or molecule. Neglecting the interaction of the dipoles we 
have, according to equation (25) (p. 457), the following expression for 


The susceptibility per unit volume is then given by 


_Np* 

" ur 


(29) 


Division of k by the density p gives the susceptibility per gramme; if 
we then multiply by the molecular weight Cr we obtain the important 
molar susceptibility 


_ kG _ Ljjf M‘\ C 

p 3 JcT' ZRT r 


. . (30) 


Herd M represents the dipole moment per mol, i.e. the moment of one 
mol when the dipoles are all completely aligned; and G is called the 
Curie constant .after P. Curie, who found the temperature relationship 
given by (30)*- The experimental determination of 0 consists in 
finding the temperature variation of X s from this tli8 molar magnetic 
moment is found by thp equation f 


M = VZRG. 


(33) 


P. Weiss found J that the molar magnetic moments of a large number 
of substances determined in this way were all multiples of a certain 
value „ r ..... 

M w = 1125-5 e.m.u., 


* P. Curie, Journ. d. Phys., 4 , p. 197 (1895). 

t Although we now know that this evaluation is not applicable, in older to have 
convenient comparison between theory and experiment we quote the results in terms 
of “ effective moments ”, i.e. moments as given by (30). 

| P. Weiss, Oomples rendus, 152, pp. 78, 187 (1611). 
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now called the Weiss Magneton. Nevertheless, the limits of error in 
these experiments are quite wide. Remarkably enough, the Bohr 
theory of the atom also yields a magneton whose value referred to 
one mol is, within the limits of error, exactly five times the Weiss 
magneton. From the standpoint of the quantum theory— upon which 
Bohr’s theory is based — it is not at all to be expected that we obtain 
a whole number of Bohr magnetons when evaluating the observed 
Curie constants, for it is a characteristic of the quantum theory that 
only a certain set of orientations of the dipole axes with respect to the 
field is possible. This will be discussed in further detail in the seventh 
part of this work (p. 665). Here we investigate only the especially 
important case where a moment is allowed to set itself only exactly 
parallel or antiparallel to the field. The distribution is governed again 
by Boltzmann’s principle. If N + is' the number of moments aligned 
parallel and 2V_ the number antiparallel to the field, then because 
+ JV_ = N, 


(22 a) 


d* + e- w «“+«*' 


and so M --- N + p — N_p — Np tanh 

In place of the function coth x — (1/a;) we have tanh*. The trend 
is again typical of a saturation curve, but the initial slope is 1 instead 
of 1/3. In what follows we need only interpret L{x) as tanhs;, and the 
result corresponds to the quantum theoretical special case of two 
allowed orientations. 

The interaction of the dipoles may be taken into account by setting 
the effective microscope* field F equal to the sum of the macroscopic 
field and a second quantity proportional to the magnetization; in this 
case, however, the factor of proportionality 4ir/3 derived on p. 454 
does not give a correct representation of the experimental data. 
We therefore insert a general empirical constant v instead. As 
long as we are justified in replacing L(x) by x/3 we have the 
expression 

M = S (H+vM) < 32 ' 


(§) • • < 2te » 


for the magnetization. Solving for M, we obtain 
Np 2 R Np* H 


• • (33) 


where the quantity vNp*/Zk, which has the dimensions of tempera- 
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ture, is denoted by 0. 
Weiss law 


The molar susceptibility then follows the 


M 2 

* ~ 3 R{T - ©)• 


( 34 ) 


If we make a graph of l/% as a function of the temperature, we obtain 
a straight line whose intersection with the axis of absciss® gives the 
characteristic temperature 0. If a positive value is found for ©, there 
must be a real temperature at which the susceptibility attains an 
extremely large value. This is actually the case for a small group of 
substances — the ferromagnetic metals. The characteristic tempera- 
ture above which these substances behave like simple paramagnetic 
substances is called their Curie point. However, for the high values of 



ferromagnetic susceptibilities we are no longer justified in applying 
our approximate formula, but must turn to the Lange vin function. 
We have the following equations for M and H from which F is to be 
eliminated: , 


| M | ^NpLl 


1*1 = 


kT) 


( 35 ) 


( 36 ) 


This may be done graphically as follows: Let the quantity x = pFjkT 
be the abscissa and take y = j M | jNp as ordinate (fig. 1). The value 
of the magnetization corresponding to a given field strength H is the 
ordinate of the point of intersection of the curve y — L(x) with the 
straight hue 

_ kT H 
y '~»Nf X vNp 

The unit along the axis of y is the saturation value Np of the mag- 
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netization. It is remarkable that under certain circumstances we 
obtain, for H — 0, two points of intersection besides 0. These are 
symmetric about 0 and correspond to a high value of the magnetiza- 
tion. This happens when the slope of the bn j is less than that of the 
Langevin curve, i.c. less than 1/3. This again gives exactly the Curie 
point for T. With ferromagnetic substances, a spontaneous mag- 
netization is possible below the Curie point even in the absence of a 
macroscopic field H. Thus we have traced the phenomenon of re- 
manence to the internal field. The high value of the constant v, how- 
ever, remains a puzzle; instead of the value 4it/3 it may be. of the 
order of magnitude of 10,000 for 
ferromagnetic substances. 

Quantum mechanics traces v back 
to “ exchange energies ” (p. 712). 

For some substances, however, this 
may lead to negative values of v. In 
this instance, neighbouring spins set 
themselves in opposite directions an 
a result of the forces between them. 

Such antiferromagnetic materials (e.g. " Coercive 
many compounds of manganese) are force ~ 
paramagnetic at higher tempera- 
tures. With decreasing temperature, 
the paramagnetism decreases in the 
usual way until tti£ Curie point (here 
sometimes called the Neel point) is 
reached, at which time the align- 
ment sets in. This is accompanied 
by a sudden decrease in the para- 
magnetism. 

Even with the origin of the constant v explained, there are still 
many difficulties connected with an understanding of the actual mag- 
netization curve. It is well known that this curve has the form of a 
loop called the hysteresis loop (fig. 2). It follows that very different 
values of M correspond to a given value of E, depending on the pre- 
vious treatment of the sample. One of the first questions arising is: 
At very small field values, why does not the magnetization jump to 
one of the two points of intersection of the curve and straight line in 
fig. 2? Considerations of stability show that, these points represent 
stable states, while tne condition of zero magnetization is an unstable 
one. In reality, the spontaneous magnetization deduced from fig. 1 
corresponds to the condition of saturation attained with the highest 
fields available. The external fields are so small in comparison with the 
internal one that they would produce only tiny displacements of the 
straight lines in fig. I when represented on the correct scale. These 
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externa! fields obviously serve only to overcome certain constraints 
that limit the value of the magnetization, whioii is large even for zero 
field. Inasmuch as these constraints depend strongly on the material 
and on its previous treatment, a great variety of magnetization curves 
is obtained. 

The saturation condition may be changed by lowering the temper- 
ature, which corresponds to rotation of the straight lines. Measure- 
ments of the temperature-dependence of the spontaneous magnetiza- 
tion of iron and nickel are in accord with theory if the L curve in fig. 1 
is represented by tanh*. As will be seen later (p. 667j), together with the 
gyromagnetic anomaly factor 2 discussed in the next section, this 
means that the electrons responsible for ferromagnetism have no 
orbital moment. However, it is not correct to conclude that these ate 
conduction electrons, for in the salts of the entire iron group the mutual 
disturbance of the electron orbits makes these substances behave mag- 
netically as though the electrons were free (see p. 669 et scq.). 

The question raised above, as w why a considerable field is needed 
to produce spontaneous magnetization, is to a great extent answered 
by investigations on single crystals. The curves for various directions 
of magnetization in iron are given in fig. 3. These curves may be under- 
stood on the basis of the following considerations: There are apparently 
preferred directions of magnetization in a crystal. For the cubic lattice 
of iron, for example, this is the [100] direction— that of the edges of 
the cube. No distinction exists between a given direction and its 
opposite. Apparently at zero field there are sizable regions or domains 
oriented in one of the six preferred directions, but because of the com- 
pensating effects of opposite senses no external magnetic effects are 
noticed. When the field i3 applied, the domains snap into the field 
direction to the extent that it is along an edge. Since the directions 
are energetically equal, no expenditure of energy is required and the 
lining-up proceeds very readily as shown by the steepness of the curve 
for this direction. Of course, there may be a potential barrier between 
the two opposite directions, i.e. the lining-up process proceeds along 
directions corresponding to greater energy. As a result, the mag- 
netization curves are not strictly vertical at the start; also, many 
domains remain in their former orientation after removal of the field 
because the thermal energy is not large enough to surmount the poten- 
tial barrier. This- is the way remanence is explained. If the field is in 
a different direction — for instance, that of the cube diagonals [111] — 
all domains line up in the directions of the edges lying nearest the 
field direction (fig. 4). Further increase of the resultant magnetization 
can occur only by the turning of domains away , from the directions of 
the edges, and this requires energy. As a result, the process of mag- 
netization goes easily up to M,j^/ 3, but beyond this point the curve 
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fig. 3. Upon removal of the field the rotated domains will promptly 
go back to the edge directions, which correspond to minimum potential 
energy. This means that the turning process, unlike the reversal pro- 
cess, is free of hysteresis. For fields in the direction of the face diagonal 
the curve begins to bend over at 

For technical polycrystalline materials the actual curve of mag- 
netization is not given simply by superposition of the various crystal 
effects. Rather, an additional circumstance — internal stress — exists, 
and this also determines a preferred direction. The preferential direc- 
tion in a stretched iron wire is that of the tension. In a region of 
uniform internal stress the actual 
magnetization vector orients itself 
in such way that the sum of the field 
energy, .crystal energy and tensile 
energy is a minimum. Since the 
stresses vary from place to place, 
neighbouring domains differ in 




energy and the external field has the additional effect of making in- 
dividual domains grow at the expense of others. Quantitative cal- 
culation of these ideas is quite troublesome and the details will not 
be given here. It is sufficient to note that such considerations have in 
recent years led to an almost complete understanding of what seemed 
at one time to be a hopelessly confused subject. 


5. Magnetism Induced by Revolving Electrons. The Magnetomechanical 
Parallelism. Theory of Diamagnetic Susceptibility. 

If we inquire into the cause of the magnetic moment of an atom, 
basing our considerations on our present concepts of atomic structure, 
we arrive at a vivid interpretation of the “ elementary currents ” which 
Ampere assumed were the origin of atomic magnetism. According to 
our modem point of view, these currents are to be identified with the 
orbital motion of electrons about the nucleus. If the period of re- 
volution of an electron is r sec., its orbit is equivalent to a circular 
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current of strength —ejr. Taking the orbit to be plane and calling 
the enclosed area S, the magnetic moment becomes 


*—\\ < 37 > 

Since the motion of the electron takes place under a central force, the 
Law of Areas {p. 89) is valid, and we may replace S/r by dSldt. Now 
the areal velocity is given in direction and magnitude by the me- 
chanical moment of momentum d divided by 2m; we then have for 
the magnetic moment 



This important relationship is called the magnetomechaniccd paral- 
lel ism. To every magnetic moment there corresponds a mechanical 
moment of momentum whose existence is due to the inertial mass of 
the electrons. This can be experimentally verified. Since the con- 
servation of angular momentum requires this quantity to remain con- 
stant when there are no external forces, a change in the magnetic state 
of a body must impart to it a rotational motion whose angular mo- 
mentum is equal and, opposite to that of the electrons. This pheno- 
menon is known as the Richardson-Einstein-de Haas Effect, and is 
actually observed. The magnetic state of the body may be altered by 
magnetizing it. or by destroying its magnetism by heating it above 
the Curie point, Formula (38) may be checked by observing the values 
of magnetic moment and moment of momentum ; surprisingly enough, 
the factor on the right comes out double the value e/2mc for all ferro- 
magnetic substances; for salts of the rare earths, for which the effect 
is so small that it is just measurable, a value between one and two 
times this factor results. We shall take up this significant anomaly in 
detail later (p. 667 ). 

The inverse effect — magnetisation by rotation — was first observed 
by S. J. Barnett in 1914, shortly before the discovery of the direct 
effect. 

If an atom has several electrons, it is possible for the mechanical 
angular inomentar— and hence the magnetic moments— to mutually 
compensate each other. In such an atom, the field gives rise to a moment 
opposite in direction to the field; its magnitude will be calculated 
.immediately. Naturally, this induced moment appears also in para- 
magnetic atoms, but is small compared with the moment already 
existing, and so may be neglected in general. According to the theorem 
of Larmor mentioned above and derived on p. 664 , the magnetic field 
brings about an added rotation of the entire electron system around 
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an axis through the nucleus parallel to H. The angular velocity of 
this motion is given in direction as well as in magnitude by 



This rotation gives rise to an additional angular momentum 

d — mSjy.v,] — m2 (V,- [tor,-]] = m 2(<o . iq* — r { . r,<a). ' (40) 

The summation is to be extended over all electrons. The radius vector 
of the ith electron, referred to the nucleus, is r t . Take the z-axis of a 
rectangular system of co-ordinates in the direction of the field. Then 
the z-component of this angular momentum, which is the only com- 
ponent having a non-vanishing resultant, becomes 

d t — mat's (r? — r ( z cos 2 #,) = mm2 (aq 2 + yf). . (41) 

Since the field has no aligning effect, all orientations of the atom are, 
on the average, equally probable, and since a : 2 + y 2 + z 2 = r 2 , 

x~? = y} = 2 ? = - 3 - (* 2 ) 


Hence by (38) the average induced magnetic moment is 
s 2 2 r , 2 „ 


and the molar susceptibility is then given by 


As one sees, the negative sign of the diamagnetic susceptibility results 
correctly; in addition, the mean orbital radii computed from (44) 
agree well with the values obtained horn other data. 



CHAPTER XXVI 


Phenomenological Theory of Superconductivity 
1. The Fundamental Equations. 

The superconducting state is characterized chiefly by the sudden 
drop in the electrical resistance of a conductor when it is cooled to 
temperatures near the absolute zero. This effect was discovered by 
H. Kammerlingh Onnes in 1911. The decline from the finite resistance 
corresponding to the temperature of the conductor to the immeasurably 
small value characteristic of this phenomenon takes place in an interval 
of only a few hundredths of a degree. The midpoint of this interval is 
called the critical transition temperature. For pure metals this lies 
between 0-1 and 10° K., while for metallic conducting compounds such 
as hydrides and nitrides of metals it falls somewhat higher. Since new 
superconductors are continually being discovered, it is feasible to state 
only the groups of elements among which no indication of super- 
conductivity has yet been found. These are the elements of the first 
column of the periodic table (the alkali metals and Cu, Ag and Au) 
and the ferromagnetic metals. 

The vanishing of the electrical resistance is not the only charac- 
teristic of superconductivity. A second circumstance, discovered by 
Meissner and Ochsenfeld, may be roughly described as the vanishing 
of the permeability. If a sample which is already in the superconducting 
state is brought into a magnetic field, current will be induced ih the 
surface layers in such manner as to cancel the field within the sample. 
The external magnetic field behaves as if the material had zero per- 
meability. However, if the sample is cooled below the transition 
temperature while in the magnetic field, nothing should occur according 
to Maxwell’s theory. Meissner found, nevertheless, that exactly the 
same field distribution set in as in the first instance. One might expect 
to deduce the behaviour of superconductors by setting o—co and 
fi = 0 in Maxwell’s equations. This does not work. 

Although no satisfactory model has yet been suggested, London and 
la ter von Laue found a way to supplement the Maxwell equations so as 
to represent correctly the phenomena and to describe them in terms of 
a single material constant which depends on the temperature. For the 
present, the Maxwell equations will be written as they stand, with e set 

486 
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equal to unity for metala. Wo retain p, for the permeability at the 
transition temperature, having in mind the fact that certain phenomena 
have not been completely explained, although the most recent results 
indicate that p may always be taken equal to unity. 

The Maxwell equations may now be written 


(1) 


curl E = — 


1 ?? 

c dt' 


• ( 2 ) 


div E — 4 rrp (3) 

div B 0 (4) 

B = pH. (5) 


The new feature is that i and p are each to be broken down into a 
normal and a “ super ” part. The behaviour of superconductors in 
high-frequency fields shows that there is an ohmic contribution to the 
current at temperatures below as well as above the transition tem- 
perature. It appears that only a portion of the electron ensemble 
participates in superconduction* while the remainder behaves in norma) 
fashion!. Accordingly we take 

* ~ *n + V, P — pn + Ps- 


The following equation of continuity holds for each part independently : 

div i n + = 0 , div — 0. ... (6) 

Pt at ' 


As before, we put for the normal current 

t, = oE (7) 

From the fact that for an electron subject to no “ frictional ” re- 
sistance the acceleration will be proportional to the fieid intensity, 
London assumed 

*<*>-» » 


where A represents a new constant of the material which has a value 
dependent upon the temperature. 

For the relation of the super-current and the magnetic induction 
(field intensity), a relation is set up which, in principle, amounts to 
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the application of the considerations of p. 465 to the case of macro- 
scopic circulation of current : 

ccurl (A*,) = --B (9) 


It should again be emphasized that these depictions by means of 
models are meant only to serve as heuristic principles and that it has 
not yet been found possible to incorporate them into a definitive theory. 

To the field equations must also be added the boundary conditions. 
Using the surface density of charge a, the boundary conditions may be 
written * 

Div B = 0 Div *„ + = 0 

Div E = 4ir(<r„ -f a.) Div i, -f ~ ! = 0. 

at 

At the boundary between two superconductors an additional 
boundary condition must be added. From equation (9), because of the 
finite nature of B, this may be written 

Curl (A*,) = 0 or * 2 4’L ^ ■ (11) 



2. Steady Fields. 

Let us first apply these equations to the simplest case, that of steady 
fields, where all time differentials may be set equal to zero. According 
to equation (8), E will then vanish; hence, by (7), i n will also vanish 
and we have i — i s , so that there is a steady super-current. As in the 
ease of a normal conductor, 

curl H = (12) 

However, according to (5) and (9), the magnetic field and the current 
are related by another equation 

curl (A i) = ~Jh (13) 

In a superconductor at constant temperature, A is independent of 
position and we may write 


curl * = - H. (13') 


Div = surface divergence (see p. 44). 
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Taking the euri of (12) and using (13') and remembering that div /xH = 0 
and cl iv * = 0, we get 

curl curl H = — curl t — — H. ... (14) 
c c 2 A ' 

Similarly. 

curl curl z = — ~ curl H = — i. . . (15) 


Imagine a superconductor of infinite extent bounded by the plane 
2 = 0 and filling all space on the positive side of the 2 - axis. Let the 
space where z < 0 be a vacuum where there exists a constant magnetic 
field Hq. By virtue of (10) and because 


divH = ?«. + 3 i + ^ = o 

dx'dy dz 


B. z must be constant in both regions. But EL = const, is not a solution 
of (14) except for = 0. This means that H can have only a tangential 
component, which we take in the y-direction. Thus 
For z < 0, H t = H x = 0; H v = H 0 . 

For z > 0, H t = H x = 0; H v =f(z). 


Because of the continuity of H, and by virtue of (14), we shall have, in 
the superconductor 


„ = Htf-e*, where P • 


(16) 


From (12), the current density is given by 


c_ as ; 

4tt dz 




v = 0 ; 


i„ = 0. 


(17) 


In a superconductor, then, the current as well as the magnetic field 
fall off with the depth. The penetration depth 1/8 is proportional to 
v'A- Independent measurements always lead to an order of magnitude 
ox about 10 -6 cm. for 1/8 for a temperature of 0-5° below the transition 
point. The Meissner effect is thus accounted for. 


3. Optical Behaviour of Superconductors. 

In high-frequency (optical) fields, the ratio of super- to norma) 
current is as follows: Because of the connexion of the two through the 
electrical field intensity E (equations (7) and (8)) we always have 


( 18 ) 
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If i„ is a periodic function of the time, 

i. -= (19) 

then i n — aco Af s , e' m(t+n/2) . ... . (20) 


The normal current leads the super-current by a phase angle jr/2. 
The amplitude ratiojs given by the product ocuA. Taking for the order 
of magnitude of A the depth' of penetration mentioned above, the 
amplitude of the normal current turns out to be a hundred times that 
of the super-current even for the infra-red region. This is in harmony 
with the negative result of all experimental attempts to detect optical 
changes when superconduction sets in. This circumstance compels us to 
assume that a normal current exists in addition to the super-current. 

Although the Laue-London theory embraces a complete picture of 
the phenomena, elaborate computations have not yet succeeded in 
supplying an atomistic theory of the superconductivity constant A 
along lines similar to our interpretation of the material constants cr, e, 
and i u as presented in the previous sections. To this extent, the present 
topic still belongs to the phenomenological part of electricity. How- 
ever, it was presented here for two reasons: Firstly, certain atomistic 
nt tions, such as the acceleration of electrons, were used in setting up 
the field equations; secondly, there is the hope that an addendum 
concerned with an atomistic verification may be forthcoming as the 
subject develops. 



CHAPTER XX VH 

The Electbodynamics of Moving Bodies 


1. Electromagnetic Induction in moving Bodies from the Standpoint of 
the Electron Theory. 

An exact formulation of att electromagnetic phenomena in moving 
bodies is possible only with, the aid of the Theory of Relativity; in 
fact, consideration of these phenomena formed the starting-point of 
this theory. Nevertheless, the Electron Theory is of such great service 
in representing most of the phenomena that, for the sake of sim- 
plicity of deduction, it seems advisable to consider matters from this 
standpoint first, rather than to begin at once to attack these problems 
with the heavy ordnance of the mathematics of the relativity theory. 
This is particularly so in view of the fact that up to the present time 
only a few experiments are known which are sufficiently accurate to 
detect the small differences in the results of the two theories (p. 476). 
The Electron Theory assumes that the electromagnetic field is located 
in a stationary “ ether ”, i.e. in an absolute space pervading all matter, 
while charge, polarization and magnetization are bound to the material 
bodies and share in their motions. In our consideration of relativistic 
mechanics we pointed out that the existence of an all-pervading ether 
is disproved by the results of three experiments whose accuracy is suffi- 
cient to settle the matter. As a consequence, all results of the follow- 
ing calculations are merely approximations, but very useful ones never- 
thelessf If we write the field equations in the form * 


1 SE SP , 

a dt c dt 


- P-V-), 


1 9H 4it 3 M 

curl E 3 - 37 . 

c dt c dt 


• . . • ( 2 ) 


divE=47r(p + — />_), (3) 

div H = 0, (4) 


* The formulation ased here differs from the usual one in which the conduction 
term is simply (4 «/e)pW. The present form is chosen for the reason that, for example, 
in electrolytes there is a conduction current even when the spaoe charge in (3) is 
not different from zero. In that oase, however, equations (1) and (3) written in the 
usual manner would contradict each other. 
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the terms dVjdt, 3M/3 1, p and pv are affected by the motions of material 
bodies. In addition to these equations we have the formula for the 
force exerted by an electromagnetic field on a moving charge e: 

r=eE+-[»H], ...... (5) 

c 

Thus, while a magnetic field exerts no force on a charge at rest, a force 
comes into being as soon as the charge is set in motion. If we look upon 
every force acting on an electrical charge as an electrical force, we 
have an “ induced electrical field strength ” 

E' = i [t>H], (S') 

We now consider a variety of experiments from this point of view. 

(a) Ordinary induction in a moving coil 

The Law of Induction embodied in the second Maxwell equation 
is based upon the assumption that the change in flux consists of a 
time variation of the magnetic field, the path 
of integration being held fixed. Nevertheless, 
the law of induction is also applied without 
further thought to the case in which the flux 
is altered by moving the conductor while the 
field remains constant. With the help of equa- 
tion (4) we shall show that this is really justified. 
If the induced electric field in (5') is integrated 
along a conducting loop there results 

(ji E’ds — - j>[vK]ds. . (6) 

Now it may be shown readily that the right 
member is equal to the rate of change of the 
magnetic flux accompanying the motion of the 
conductor. Let us consider two neighbouring positions of the loop at 
times l and t 4- dt. The change in position may also involve a defor- 
mation. Connect corresponding points of the two contours by displace- 
ment vectors vdt. We thus complete a closed surface over which we 
take the integral of the magnetic field strength. Since the divergence 
of H vanishes in a uniform medium of permeability p., this surface 
integral is zero, by Gauss’s theorem. Now the element of the lateral 
surface (see fig. 1) may be expressed as 

d8f = — [vds]dt, 
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0=-/ HtfSri- f H(iS — A fn[vdsl 

d0> = jHdS— Jnda^ — dtfjvRlds, . . (7) 

f[vH]ds (8) 

In a medium of permeability /x the force, on tile moving charge is p 
times as large, so that we obtain p times the induced potential and the 
integral in (8) represents the flux of induction. Thus it amounts to 
the same thing if the change in time of the induction flux takes place 
through a change of field for a fixed loop, or by the motion of the loop 
in a constant field — a fact tacitly accepted in electrical engineering. 


so that 
Hence 


(b) Dielectric 'polarization induoed in insulators 
H. A. Wilson'performed the experiment of rotating a hollow cylinder 
of a dielectric substance in a magnetic field parallel to the axis. The 
induced electric field acting on the quasi-elastically bound electrons 
of the substance displaces them relative to the nuclei and causes a 
resultant polarization. Neglecting the internal field, this quantity is 


given, by 


A T a[r;H] 


( 9 ) 


where N is the number of atoms per unit volume and a their polariz- 
ability, Using the relationship between Na and K (of. p. 452), we obtain 
If — IfeH] 

4tt c 

In our case P is radial and has the magnitude 
(K - 1 )vH 
?7 re 




(10) 


( 10 ) 


The rotating dielectric has thus assumed a volume polarization and 
is to be compared with a permanently polarized body or “ elected; 
There are free charges on the surface, their surface density a being 
given by the magnitude of P, according to p. 281. The detection of 
these charges is accomplished as follows: the inner and outer sur- 
faces of the cylinder are covered with metal. For the present case it 
is immaterial whether the metai coatings rotate along with the cylinder 
or not, for the magnetic effects due to the moving charges are not 
measured here. If, during the rotation, the two metal coverings are 
connected and earthed, the total flux emanating from the surface of 
the dielectric ends on the coatings, i.e. there is induced upon them 
a charge equal and opposite to the free charge on the surface of the 
dielectric. If we now break the connexion between the coatings and 
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annul the volume polarization of fell© dielectric by stopping the rota- 
tion or cutting oJJ the magnetic field, a surface charge of density a -- P 
remains on the coatings. This charge gives rise to a potential difference 
between the coatings which can be measured by an electrometer. This 
difference of potential depends on the capacity of the cylindrical con- 
denser and of the electrometer. The measurements confirm the pre- 
dictions of the theory. 


(c) W. Wien's experiment 

The displacement of charge (i.e. polarization) generated in atoms 
by motion in a magnetic field may be detected in a particularly favour- 
able optica! case. As discovered by J. Stark, the spectral lines of 
hydrogen are split up int-o numerous components in an electric field. 
This is the so-called Stark effect. Wien caused hydrogen canal rays 
to pass through a magnetic field which was normal to their direction 
of motion. This is equivalent to an electrical field norma] to the plane 
of v and H and of magnitude vHjo. By observing in the direction of 
the magnetic field one actually sees a pattern identical with the Stark 
effect pattern obtained in transverse observation in an electric field. 

2. Magnetic Effects of Moving Charges. 

(a) Roivland ! s experiment 

The next experiment which suggests itself is the testing of the 
equivalence of a mechanically moved charge with an electric current. 
This is accomplished by imparting a large velocity to a charged body 
and detecting the magnetic effects of this artificial rotational current. 
H. A. Howland set a charged, insulated metal plate into rapid rotation 
and obtained the magnetic effect given by the third term of the right 
member of equation (1) (p. 471). 

(6) The experiments of Rontgen and Eickemcald 

Rontgen and Eiehenwald were able to show that the charges 
residing on the surfaces of a polarized dielectric (a= j P i) correspond 
to a current when the body is set into motion. To demonstrate this, 
Eontgeii caused the dielectric between the plates of a charged con- 
denser to rotate; the magnetic effect calculated from the surface 
density of charge and the rotational speed was found to agree with 
the observed value. 

Using another arrangement, the same investigators were able to 
show that the polarization could be altered by mechanical motion, 
this change causes, a magnetic field corresponding to the second term 
of equation (1) (p. 471). In this experiment we again make use of a 
rotating dielectric disc, but the condenser plates are divided “into two 
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halves along a diameter, and adjacent parts are held at opposite 
potentials (fig- 2). Thus the sign of the polarization at any point 
reverses when this portion crosses the 
line of separation, so that a vertical 
“ displacement current ” flows in the 
separating layer. The existence of this 
current was detected by its magnetic 
effects. 

— I + 

lc\ The experiment of Trouton and I 

W Noble 

Trouton and Noble suspended a parallel plate condenser in such 
fanner that it could turn freely about an axis parallel to the plates.* 
The effect to be expected is 
that, as a result of the trans- 
latory motion of the earth, 
the charged condenser will 
turn into a position in which 
the plates are parallel to the 
direction of this motion. In 
order to see that, if we use 
the ether conoept, the motion 
of the earth would cause a 
torque, it is convenient to 
imagine the plate charges & 
concentrated in two small 
spheres whose separation is equal to that of the condenser plates (fig. 3). 
Let the density of charge in these spheres be p. The translatory 
motion of the charge -f-a due to the earth’s motion corresponds to 
a current element ds for which 



Ids = ev. f 


.... ( 11 ) 

According to the Biot-Savart Law, this current element gives rise to 
a magnetic field at the position of the negative charge. This field is 
given by 

_I[dsr]_ c [vr] 
cr 8 ’ 


(12) 


where r is the vector drawn from the positive to the negative charge. 
Thus the field is normal to the plane of v and r. The force with which 

• F. Trouton and T. Noble, Phil. Tram., A, 202, p. 165 (1903); Pros. Pay. Soc., 
72, p. 132 (1903). 

f If we imagine the sphere divided into slices normal to v, each of thiokness ds, 
then each alioe corresponds to a current element pA ds . v. Integration over aii these 
fUoeti yields the volume of the sphere multiplied by pv, wbenoe (11) follows. 
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this field acts on the negative charge, which is also moving with 
velocity v, is given by 

E sS 

c c? r 3 


(13) 


This force, wnioh is normal to the plane of H and v, is of magnitude 
„ eV sin 6 


(14) 


Similarly, an equal and opposite force acts upon the positive charge 
as a result of the field caused by the motion of the negative charge. 
The two forces together furnish a torque parallel to the axis of rotation 
and of magnitude 

M—Fr cos 0 — — ^ sin# cos# 

= ( 15 ) 


This turning moment tends to turn the plates parallel to the “ ether 
wind ”, Now e®/r is the electrostatic energy TJ& of the system. Insert- 
ing this value in equation (15) we obtain the correct formula for the 
parallel plate condenser, except for a factor 1/2, which is due to the 
simplifying assumptions made regarding the distribution of charge. 
The correct formula is 

Af= £7 e ,0ysin20. ..... (16) 

Thus a condenser oriented at an angle rr/4 with the direction of the 
earth’s orbit should be Bubject to a torque U t \{vjcf when charged. 
A moment of this magnitude should be easily observable, yet even 
the most delicate experiments, especially those conducted by Tomaa- 
chek, failed to give any indication of such a torque. The result of the 
Trouton-Noble experiment, which is one of the very few experiments per- 
mitting of -observation of quantities of the second order in v/c, contradicts 
the theory of a stationary ether. 


(d) Variability of the mass of the electron 
The magnetic field of a moving point charge represents a certain 
energy distributed in space, the generation of which requires a certain 
amount of work; i.e. the electron possesses inertia which, as calculation 
shows, increases with increasing velocity. But since the mass is mani- 
fested only by its inertia, this indicates that the inertial mass of an 
electron increases with velocity. The amount of this increase is different 
for acceleration in the direction of motion and normal to this direc- 
tion. We therefore speak of longitudinal mats and transverse mass. 
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According to a calculation made by Abraham, these quantities for a 
rigiA spherical electron are: 

longitudinal mass m t = m 0 ^1 + f . . (17) 

transverse mass m t = «i 0 ^1 ^ — ). . . (18) 

Again, the lowest power of v/c which appears is the second. In addi- 
tion to our previous experiments this phenomenon may also be called 
upon to decide for or against the concept of a stationary ether. As 
mentioned on p. 430, this variability of electron mass has been 
observed for some time. The theory of relativity leads to a simpler 
formula which also contains correction terms in (n/c) 2 and which 
deviates from the above by about twenty per cent. The most exact 
measurements have decided unanimously against the formulas (17) and (18) 
derived from the ether concept, and in favour of the relativistic formula. 

3. Propagation of Electromagnetic Waves in Moving Media. 

(a) Fizeau’s experiment 

The classical experiment on the propagation of light in moving 
media is that of Fizeau * in which light was made to pass through 
flowing water. The small difference between the index of refraction 
of moving and stationary water was measured by an interference 
method. Approaching the question quite naively, one might believe 
that the velocity of the flowing water Bimply would be added to the 
phase velocity as givep by the index of refraction for water at rest. 
It is apparent, however, that this view leads one into difficulties, for 
evidently this convection effect must exist to the full extent even for 
very tenuous moving matter (e.g. in a rapidly flowing gas), while in a 
perfect vacuum there can be no effect. This would mean a discon- 
tinuity contrary to all experience. The actual result of the experiment 
showed that the wave velocity u is increased by the velocity of flow 
v multiplied by the factor (1— l/» a ). The factor (1 — 1/n 2 ) which 
approaches zero when n approaches unity is called Fresnel’s con- 
vection coefficient. This result is obtainable also from the theory of a 
stationary ether, but follows much more simply from the relativity 
theory as a consequence of the addition theorem for velocities (of. p. 244). 

( b ) The Mickekon-Morley experiment 
A largo number of experiments intended to detect the influence 
of the motion of the earth on the propagation of light have been pro- 
posed. For all of these, with one exception — the Michelson-Morley 
experiment — the accuracy of observation is not great enough to detect 
* H. Fizeau, Comptes rettdtu, i3, p. 349 (1851); Ann, d. Phus. u. Chan., Erg., 8. 
p. 457 (1853). 
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deviations of higher order than the first power of vjc. The stationary 
ether theory yields no first-order effect for a common uniform trans- 
lation of source, observer and intervening parts of the apparatus. 
To that extent all these experiments also confirm the theory of a 
stationary ether. This theory, however, does demand a second- order 
effect, and the Michelson-Morley experiment is the only optical ex- 
periment sufficiently sensitive to deteot this effect. This experiment 
has already been discussed in detail on p. 238. The absence of a second- 
order effect here , the negative result of the TroiUon-Noble experiment and 
the departures of the mass variability of the electron from the Abraham 
formula (p. 477) are evidence for the fact that the concept of a stationary 
ether cannot correspond with reality. 

( c ) Doppler effect and aberration of light 
These phenomena have been discussed in detail on p. 245. 

(d) The experiment of Sagnac 

In this experiment the source of light, the observer, and the inter- 
mediate apparatus are given a common rotational motion.* Thus, 
in spite of the fact that all parts 
have a common translational motion, 
there is a rotational effect to be ob- 
served. The experimental arrange- 
ment is shown schematically in fig. 4. 

The semi-silvered plate divides the 
beam into two portions — one travers- 
ing the quadrilateral formed by the 
mirrors in the direction of rotation, 
the other in the opposite direction. 

An interference pattern is formed 
when the beams unite, just as in 
the Michelson-Morley experiment. 

The interference fringes are regis- 
tered on a photographic plate at- 
tached to the apparatus. Fi * -* 

As in the Michelson-Morleyexperi- • 

ment, assume the classical compounding of velocities and compute the 
times required for the light to go in both directions and arrive again at 
G. T aking the axis of rotation as origin, suppose the light travels 
along a curve whose polar equation is r — r(<f>). If, at a given instant, 
an element of the curve makes an angle 0 — cos -^(rdf/ds) with the 
direction of its translational motion arising from the turning of the 

* G. Sagnao, Cornetts rendu#, 157, p. 708 (1013); Journ. de Thysique, 4, Ser. 6, 
p. 177 (1914). 
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whole apparatus, then the relative speed when going around in the 
parne direction as the rotation will be 

v_ = c —<ur cos 0 — o — r*to 


and in the opposite direction 




The difference of the two corresponding time intervals is 
T_- = £ —* - 

c+Ao 'i 

Since the second term in each denominator is small, this becomes 

where S is the area enclosed. Then, by p. 239, the displacement of the 
fringes amounts to 

AZ = ^wS, (19) 


where to is the angular volocity of rotation, S the area of the surface 
enclosed by the light path and A the wave-length of the light. This 
displacement is, in fact, observed. Experiments were performed using 
widely different values of both to and 8. In one trial Sagnao used a 
rapidly rotating diso of about 1 m. diameter; in another trial the 
apparatus was taken on board ship; the angular motion was obtained 
by sailing over a curved course. In recent years Michelson and Gale * 
used the rotation of the earth itself; in order to compensate for the 
small value of to, a light path several kilometres in length was used. 
Technically, this experiment differs from that of Sagn%c in that the 
zero position of the fringes cannot be determined by stopping the 
rotation; hence it was necessary to have a second light path, enclosing 
a negligible area, so arranged as to give fringes serving as a reference 
mark. Further, unless the circuit is at one of the poles, the angular 
velocity vector is not normal to the plane of the circuit, and only the 
normal component is effective. The result of this experiment was in 
complete accord with the calculations. The fact that the stationary 
ether concept leads to correct results for rotating systems in optical 
experiments also is no more a proof of the existence of the ether than 
are the mechanical phenomena of the Coriolis and centrifugal forces 
(cf. p. 233). The Michelson-Gale experiment is the optical analogue of 
* A. A. Miohelson and H. G. Gate, Asbrophya. Joum., 61, p. 140 ( 1 925). 
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the Foucault pendulum. It is of importance from the fact that it makes 
all attempts to explain the negative result of the Michelson-Morley 
experiment by convection of the ether seem futile, for it seems absurd 
that the ether could be carried along by the earth completely in trans- 
lation and not at all in rotation. As explained on p. 258, the above- 
mentioned mechanical experiments are explicable under the generalized 
theory of relativity without using the concept of an ether at rest, 
and the same is true of the Sagnac experiment. 

4. # Relativistic-invariant Form of the Electromagnetic Equations. 

The mathematical formulation of the theory of a stationary ether 
involves the application of formulae (1) to (5) (p. 471), with the 
derivatives d/dt ... for the moving bodies formed according to the 
Galilean transformation; i.e. we equate the total time rate of change 
of a quantity « to the sum of the pure time rate dujdt for a fixed point 
and the spatial change V grad w due to displacement of this point, as 
in hydromechanics. But, as already indicated, this theory leads to 
conflicts with experience for effects of second order. Small as are the 
differences between the theoretical and experimental results, they 
nevertheless compel us to abandon the theory. Now it is very re- 
markable that, as will be shown below, the equations of the electron 
theory may be set down at once as equations connecting world vectors, 
which means that they are invariant under the Lorentz transformation 
and not under the Galilean. One might at first believe this to be a 
matter of course, since the Lorentz transformation was introduced in 
such a way that the wave equation remained invariant under it. It is 
to be noted, howqjper, that while the wave equation is a consequence 
of the electromagnetic field equations, the electromagnetic equations 
are not, reciprocally, consequences of the wave equation, since the 
latter also occurs elsewhere, e.g. in the Theory of Elasticity. Again, 
the invariance of the electromagnetic equations under the Lorentz 
transformation — which is, as a matter of fact,' identical with the 
Galilean to a first approximation — shows that the stationary ether 
theory also fails to indicate first order efEects when all parts of an electro- 
magnetic system share a common motion. This fact was stated in the 
preceding section, but without proof. 

In order to obtain a simple form of equations (1) to (4) (p. 471), 
we restrict our considerations to weakly magnetic substances, so that 
the term 9M/3f in equation (2) may be omitted. In addition, let us 
combine the displacement current dPjdt with the conduction current, 
i.e. let the terms {^Trjc)p i v { consist of the sum of a convection current 
of free charges and a part due to the displacement of the bound charges 
from their equilibrium positions. The term dPjdt then appears to be 
absent, but actually pv includes feqth the conduction current and the 
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contribution of the displacement. Writing Ijic in place of t, the co- 
ordinate forms of equations (1) and (3) become 

dH M dH v diE x in, . i 

Hy dz W = ~c {p+V+ “~ p - V -*> 


d H„ 
dx 


^ dz 

3 H x 
Sy 

, 3 iE x . diE y , diE a 


' nr = 


in 


(P+V+ V — P-V-y) 


3 Hy 


diE a in . . 

~ nr ^ r {p+v+ ’- p - v ~-' ] 


„ OXJOly . OlE/y 

~ dx 3 y dz 


- ±-*i(p+ — P-) 


. ( 20 ) 


while equations (2) and (4) are 

diE, diE v 

dy dz 


3 H x _ 
' W~ 


diE . 

3t B, 

dHy 
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nr 

di Ey 

diE, 

dH , 
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nr 

dHy, 

dHy 

dHy 

-3F + 


37 


( 21 ) 


We proceed to show that these are the component forms of two equa- 
tions between world vectors, or tensors. First we investigate how the 
density of charge p changes under the Lorentz transformation. Start 
from the assumption that the charge e = pAr is independent of the 
state of motion, and so does not* change in this transf ormatio n. On the 
other hand, all volumes are reduced in the ratio yl — fP : 1 by the 
Lorentz contraction; hence to a moving observe r the charge density 
p appears to be increased in the ratio l/i/l— /P : 1. Denoting by p 0 
the density of charge as measured by an observer moving with it, we 
have 

p = or pVl — p 0 = invariant. (22) 


According to p. 266, the components of the four-velocity q T are 


VI- 


Vi - p' Vi - 


VT=!p’ 


If we multiply each of these expressions by the invariant of (22) we 
obtain the components of a new world vector which we call the 

17 (b 7X1) 
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four -current s r ; the positive charges yield the f our-cur rent s r + — the 
negative, the four-current s r _. Since the factor y'l — J3 2 cancels out in 
the multiplication, the components are the right members of equations, 
(20), apart from a factor 4w/c. Consequently, the left members must 
also be the components of a world vector. This is derived from a world 
tensor by an operation which is independent of the system of co- 
ordinates. We aready applied such an operation in three dimensions 
in the Theory of Elasticity without introducing any special designation 
for it. If, in the three-dimensional case, say, we have a tensor in the 

form T = a x . i + a % .j -f- a 3 . k, 

and we use this tensor as a postfactor in multiplying with the nabla 
operator, we obtain 

VT = . * + Va 2 J + Va z .k 

— div a^.i div a z .j + div a 3 : ft, 

i.e. a vector whose components are div a lt div a 2 , div a s . We shall refer 
to this operation as the formation of the divergence vector of the tensor 
T, and shall designate the operation by 

VT = divT (23) 

The extension to the four-dimensional case involves nothing new. If 
we call the world tensor whose components are 


0 

B. 

-Hy 

-iE t 


0 

H a 

~iE t 

H * . 

-E x 

Q 

-iE, 

iE x ~ 

iE v 

iE, 

0 


the electromagnetic field tensor <h r , we see that the components of the 
world vector of equation (20) are obtained from this world tensor 4> 
precisely by the operation given above. Hence equation (20) may be 
written . 

div r 0> r = ^ {s\ + S‘-_) (24) 

But according to equation (65) (p. 251 ), the system (21) above states 
that the antisymmetric world tensor *3> r is derived from a world vector 
potential V r by taking the curl in four dimensions: 

<3> r = curl r V r . (25) 

The formulation of the laws of electromagnetism given by (24) and 
(25) seems, at first, somewhat colourless; nevertheless, compared with 
the usual three-dimensional vector equations, it has the great advan- 
tage that it combines the electric and magnetic fields so as to produce 
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a single unifi ed field specified by the tensor <E> r , the resolution into 
electric and magnetic components depending upon the state of motion 
of the observer. A field which appears to an observer at rest to be 
purely magnetic may, for a moving observer, appear to have an elec- 
trical component. Moreover, the change from one system to another 
is reduced to the purely formal operation of transforming four-dimen- 
sional tensor components. For example, in the experiment of W. Wien 
(p. 474), the world tensor <E> r has the following components for the 


system in which the apparatus is at rest: 


0 

H 

0 

0 

-H 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0. 


On the other hand, consider a canal ray particle moving in the direc- 
tion of the a;-axis with velocity v. Since 

V = l cos <f> + x sm</>, 
x' = — l sin^ + x cos <f>, 
tan<£ = —ij 3, 


the rule for transforming of tensor 

components (pp. 34, 249) yields 

0 

H cos<f) 

0 

0 

—Hcoa<f> 

0 

0 

— H sin<£ 

0 

0 

0 

0 

0 

H sin^ 

0 

0 


We see that an electrical component has made its appearance, viz.: 
iEJ = sin^ = - - ~L %. or EJ = 

’ VI -P VI -p 


which is identical with that derived from equation (4), p. 471, as far 
as quantities of second order. Likewise, the magnetic component 
remains unchanged as far as second order quantities. 

Ex. 117. Derive the Fresnel coefficient from the relativistio velocity addition 
theorem. 



PART V 

The Theory or Heat. Phenomenological Part 


The Theory of Heat, like the Theory of Electricity, may be developed from 
two different points of view. The phenomenological method employs only oon- 
oepts like temperature, quantity of heat, &o., which are taken from the macro- 
scopic world of observations and which can be measured directly. The laws thus 
obtained have the advantage of being free from hypothetical assumptions. On 
the other hand, in the Theory of Heat, e.g. in connexion with the Law of Entropy, 
we feel the need of a deeper “ explanation ”. Such an interpretation, deeper 
because more vivid, is furnished by the atomistic, statistical view. Here again, 
as in the electrical case, we find that although we develop the theory separately 
from the two sides, it will often prove advantageous, while we follow (me line of 
approach, not to shut our eyes completely to the other. 



CHAPTER XXVIII 

Theory of the Conduction of Heat 

1. Definition of Temperature, Quantity of Heat, Thermal Capacity and 
Specific Heat. 

The sensory organs of the skin permit us, within, certain limits, 
to perceive temperature. In particular, we can decide readily which 
of two objects is the warmer. However, this qualitative measure of 
temperature is not sufficient for physical measurements ancl we must 
seek a phenomenon which is causally connected in some way with 
the temperature of a body. The increase in volume of a substance, 
e.g. mercury, with rising temperature is a phenomenon of this kind. 
For convenience of observation of the expansion, the mercury is con- 
tained in a bulb and allowed to expand into a fine capillary tube at- 
tached to it. A thermometer of this kind is calibrated by immers- 
ing the bulb in melting ice and marking the position of the mercury 
filament, then immersing the bulb in the steam rising from water 
boiling at standard atmospheric pressure and again marking the 
position of the thread of mercury. The interval is then divided into 
one hundred equal parts. We have thus obtained a provisional scale 
of temperature. If, now, we repeat this process with another ther- 
mometric substance, we find that accurate observation reveals a diffi- 
culty; vis. since the volume is not strictly a linear function of tem- 
perature, two thermometers filled with different substances and cali- 
brated as above will not agree exactly at intermediate points when 
immersed in the same substance. We might of course avoid this diffi- 
culty by the arbitrary choice of a standard thermometric substance, 
say mercury. There is, however, a better plan: we obtain almost 
perfect agreement by using gases which are as far as possible from 
their, boiling-points. For practical reasons a gas thermometer is usually 
arranged so that we measure not the expansion but rather the increase 
in pressure occurring when the volume is held constant. Since indi- 
vidual differences arise when such gases are in the neighbourhood of 
their boiling temperatures, we choose as a thermometric substance 
that gas which, under ordinary conditions, is farthest from itB boiling- 
point, viz. helium. Thus accurate thermometric measurements are based 
upon a temperature scale determined by the increase in pressure of a con- 
tsv 
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fined volume of helium. Moreover, the gas thermometer defines a natural 
zero point for our scale — the so-called absolute zero — which is at — 273° 
on the Centigrade scale defined above. The determination of this zero 
point will be discussed later. The temperature scale defined on this 
basis departs but little from one defined on purely theoretical grounds, 
the thermodynamic scale, which wifi be explained later. We shall 
denote temperatures on the Centigrade scale by t, those on the absolute 
scale by T. There will be no danger of confusing t with the time, since 
pure thermodynamics deals only with equilibrium. In the Theory of 
Heat Conduction, however, we shall denote the temperature by 6. 
Since, in the latter case, we are concerned only with temperature 
differences, it is immaterial whether we understand this to be Centi- 
grade or absolute temperature. 

In addition to temperature there is a second important concept — 
that of quantity of heat — which is given by ordinary experience. If 
we drop a 1 gm. mass of iron at 100° C. into 10 gm. of water at 10° C. 
and, in a second experiment, drop a 10 gm. piece of iron at 100° C. 
into the same amount of water at 10° C. as before, we observe, 
after the systems have come to equilibrium, that the rise in tempera- 
ture of the water is about nine times as great in the second case as in 
the first. This leads to the idea that a body possesses a given heat 
content which is equal to the product of the temperature by a quantity 
proportional to the mass. This latter quantity is called the heat 
capacity. It is to be noticed that we deal only with differences in quan- 
tities of heat, so that the absolute value is undetermined. The pro- 
cess of coining to equilibrium in the above example may then be 
interpreted as the transfer of a given quantity of heat from one body 
to another. Without forming special conceptions of the nature of 
heat, we postulate a conservation of quantity of heat for such processes. 
As a unit of quantity of heat, we choose the amount of heat necessary 
to raise the temperature of one gramme of water from 14*5° C. to 
15*5° C., and call this a small calorie (sometimes “ gramme calorie ”). 
There is a larger unit, the large calorie (or kilogram calorie) which is one 
thousand times as great. 

Experimenting further, we find that 1 gm. of aluminium at 100° C., 
when placed in 10 gm. of water at 10° C. imparts to it a greater rise in 
temperature than does 1 gm. of lead at 100° C. Hence we set the heat 
capacity equal to the product of the mass by a constant of the material. 
This constant is taken to be unity for water at 15° C. The quantity 
of heat in calories required to raise the temperature of 1 gramme of a 
substance 1° C. is called the specific heat of the substance, preferably 
called the specific heat per gramme. There is another quantity which 
is important — the specific heat per mol, i.e. the thermal capacity of one 
mol of a substance. The advantage in using this molecular heat is that 
in comparing the molecular heats of various substances, the same 
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numbers of molecules are involved. We stall denote the thermal 
capacity by C, the specific heat per gramme by c, and the molecular 
heat by c. In order to determine the equilibrium temperatures in the 
above examples we need only equate the quantities of heat involved, 
before and after mixing. In the example Of 1 gm. of iron at 100° C. 
placed in 10 gm. of water at 10° C., the end temperature a; is obtained 
frojn the relationship 

I . c, . 100 + 10 . 10 = (c, + 10)*, 

where c, is the specific heat per gramme of the iron.* 

In spite,of this simple applicability of the concept of heat content, 
it must be mentioned that these simple relationships exist only when 
the bodies under consideration do not undergo other changes. In 
general, the heat capacity is by no means a function of the temperature 
alone (ef. p. 502). 

2. The Differential Equation of Heat Conduction; Initial and Boundary 
Conditions. 

Common experience shows that any temperature difference be- 
tween parts of a body reduces to zero after a time, so that heat must 
flow from the places of higher temperature to those of lower tem- 
perature. It is natural to assume that the density of the thermal 
current, i.e. the quantity of heat passing per second through unit area 
of a surface normal to the direction of flow, is proportional to the 
negative of the temperature gradient. This assumption is confirmed by 
experiment. The factor of proportionality Te is called the thermal 
conductivity of the substance. If now we imagine a certain volume of 
a body isolated, the rate of decrease of the heat content is 

dW 3 r Q , 

~W == ~di J p0c ''•**’ (1) 

where 0 is the temperature and p the density. This decrease is due to 
conduction, the heat flowing away through the bounding surface. The 
flow is given by 

- — = -ft/gradfldS (2) 

Equating the two, we have 

~ jpOcgdr— k£ grad 9d8 — kj div grad 0dr. . (3) 

• This is equivalent to equating the heat given up by the hot body (or bodies) to 
that absorbed by the cold, in which case the heat equation is 

1 . c i,(100 - *) - 10(* - 10), 

which is, of eourse, the same as the equation given above, _ 



HEAT. PHENOMENOLOGICAL PART 


490 


[Chap. 


Since this must hold for every element of volume, we have the equation 
of heat conduction 


— A0=mA0 < . (4) 

dt pc, ' ' 


The constant m = k[pc, was named the diffusivity by Kelvin.* 

* As in the case of all partial differential equations, the solution of 
the problem is not defined unless initial and boundary conditions are 
given. Thus there must be given a definite distribution of temperature 
at time t = 0, and also information as to what is happening at the 
boundaries of the body. The surface of the body, for example, may 
be completely insulated. In general this-«ejll not be the case; as a rule, 
heat will pass outwards through the surface. It is found that the 
surface density of the flow of heat normally outwards through the 
surface is approximately proportional to the difference in temperature 
between the surface of the body and the surrounding medium. 
The factor of proportionality is called the surface conductivity , or 
emissivity. 


3. A Simple Example of the Integration of the Equation of Heat Con- 
duction: Penetration of the Daily and Yearly Temperature 
Variations into the Interior of the Earth. 


Take the surface of the earth to be plane and assume the surface 
to undergo periodic temperature variations as a result of the fluctuating 
radiation to which it is exposed. The boundary conditions here are 
especially simple, for there is no heat flow through the lateral surface 
of a cylinder whose generators arc vertical. This is because the iso- 
thermal surfaces are planes parallel to the earth’s surface, as may be 
seen intuitively; hence the entire flow is normal to the surface of the 
earth. Taking the z-axis along the inward normal to this surface, the 
differential equation becomes 


00 

dt 


0 S 0 

W 0F 


(O 


Since 8 at the surface is a periodic function of the time, we take 
8=f(x)e?»‘ (5) 


Substituting this value in (4') we obtain the following differential 
equation for /(*): 


A- 


iwf— 0, 


(6) 


* Clerk-Maxwell called this quantity the thcrmometric conductivity, but Kelvin's 
terminology is more widely used. 
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which we seek to solve by setting / = ac 1 *. The “ characteristic 


equation ” becomes 


m\ 2 — — 0 , 


(7) 


whence 

±J^=± + O (8) 

V m V 2m 


The negative sign is the one to be selected, for if we limit our inquiry 
to the penetration of the surface variations, there must be a decrease 
toward the interior. Then 

B == e»“tt-(*/V2mu)3 ( 9 ) 

This i s the equation of a damped wave propagated with velocity 
0 = \Z2moj. This shows that the velocity is greater for the diurnal 
variation than for the annual, but at the same time the damping is 
greater for the former. 

It is also easy to fulfil the initial time conditions for x — 0. Since 
the real as well as the imaginary part of (9) represents a solution, we 
may obtain the solution for any values of 9 0 and (dO/dx) 0 from the 
general solution containing two constants of integration: 

6 = Ae~ V, “ /2m * COSto it \ 

\ VZmw/ 

+ Be-^-‘ . . . ( 10 ) 

if, for example, .we begin counting time from a moment when 
6 at x = 0 is a maximum, we have the solution 

6 — B mtI e~ v “ l2mx coscc^t — . . (10') 

where B and are, of course, the differences reckoned from the average 
temperature. 



CHAPTER XXIX 

The Equation of State of Thermodynamic Systems 

1. Definition of the Thermodynamic Variables and the Relationships 
between them. 

In thermodynamics, the treatment of which we begin here, the 
word “ system ” is used as the most general and neutral expression for 
an aggregation of bodies considered as a whole and assumed to be 
isolated from their surroundings. A system may consist of several 
distinct physical parts called phases. For example, if we have a 
saturated NaCl solution containing an undissolved crystal of NaCl, 
the whole being covered by a bell jar, then we have a system of three 
phases: a solid phase (the crystal), a liquid phase (the solution) and a 
gaseous phase consisting, in this instance, of a mixture of air 
and water vapour. Since all gases are miscible, a system in equili- 
brium can have but one gaseous phase, but may have several solid or 
liquid phases. For example, we can introduce an additional solid 
phase by placing a crystal of quartz in the solution. The thermo- 
dynamic behaviour of the system is dependent upon the number of 
“ independent components ” or “ substances ”. We understand this 
to mean the number of freely variable chemical compounds or elements, 
contained in the system. To obtain this number, determine first the 
number of elements present (combined or free) and subtract from this 
the number of elements defined chemically in terms of the remaining 
(independent) elements. For example, water and ice constitute a two- 
phase system of one component, since the quantity of oxygen present is 
determined by the quantity of hydrogen. On the other hand, a solution 
of NaCl in water is a single-phase system of two components. The 
state of a system is determined, in the thermodynamic sense, by giving 
the mass and chemical constitution of each phase as well as the pres- 
sure, volume and temperature. These quantities, however, are not 
independent of one another, but are connected by a relationship called 
an equation of state. 

We limit our present considerations to a single-phase system whose 
chemical constitution is invariable, e.g. a system consisting of 10 gm. 
of liquid benzene. Since the mass and chemical constitution are fixed, 
there remain only the three quantities temperature t (or T), pressure 
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p and volume V as true thermodynamic variables. Temperature and 
methods of measuring it have already been discussed in Chap. XXVIII 
(p. 487). The volume is measured in cubic centimetres and the pressure 
in dynes per square centimetre. In some instances the pressure is 
measured in atmospheres, this being the pressure capable of support- 
ing a 76 cm. column of mercury at 0° C. 

1 Atm. = 76 X 13-6 x 981 = 1-013 X 10“ dynes/cm.* = 1-013 x 10 6 newton/m.* 


Now there is generally an equation of state 

F(p,t,V)= 0, (1) 


according to which the condition of a simple system is determined 
by fixing the values of two of the variables. ^ An equation of state 
which is valid over large ranges can be given only for gases. For liquids 
and solids these equations are more like interpolation formulae holding 
over a limited range. But, as we shall see presently, important con- 
clusions may be drawn from the mere fact that an equation of state 
exists. 

We now consider those changes of state in which each variable in 
turn is held fixed. We thus arrive at definitions for three coefficients, 
viz. the following: *“ 

The isobaric volume coefficient of expansion a is defined as the rate of 
change of volume per degree divided by the volume at 0° C., the pres- 
sure being held constant, i.e. 


V 0 A t 


P(p, 0 _ 


. ( 2 ) 


The subscript attached to the partial derivative indicates the quantity 
which is held constant. 

The isometric (or isochoric) pressure coefficient j8 is defined as the 
rate of pressure change per degree divided by the pressure at 0° C., 
the volume remaining constant: . 

/? = LlimPg^ ± M zi£(Zd) = l/gF\ . . ( 3 ) 

Po A t Po\ 2i /r 

Finally, the isothermal compressibility k is defined as the rate of 
change in volume per unit pressure (or per atmosphere) divided by 
the existing volume, the temperature being held-constant: 

1 V(t,p + Ap)-V(t,p ) __l/dV\ 

V A p ~ V\dp)t w 

Since the volume decreases with increasing pressure we insert the nega- 
tive sign to makes: positive. The reciprocal of k is called the isothermal 
volume elasticity E t . It is to be noted that, for reasons of convenience. 
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the coefficients are so defined that o and are referred to the volume 
or pressure at 0° C., while k is referred to the existing volume. The 
advantage of this convention becomes apparent in the case of gases. 
For liquids and solids it is practically immaterial whether the change 
is referred to the volume at 0° C. or the volume at the existing tem- 
perature. Again, it is immaterial whether we write t or T in defining 
a and /}, since temperature differences only are concerned. The exis- 
tence of an equation of state means that the three coefficients are not 
independent. If, for example, we imagine the equation solved for p, 
then 

p = p(F,t) 

“ d +(&),*■ < 6 > 

If p is held constant we have* after division by dt, 

Inserting the coefficients defined above in place of the derivatives, 



Ex. 118. A mercury thermometer whose capillary is just filled at 46° C. is 
warmed to 50°. What is the pressure in the capillary if the coefficient of expan- 
sion a = 18 X 10“« deg. -1 and the compressibility * = 39 X 10“ 7 Atm. -l f 

2. The Equation of State of an Ideal Gas. 

The following empirical relationships have been found to hold for 
gases far removed from their point of liquefaction: 

1. The Law of Boyle and Mariotte . — At constant temperature the 
volume of a gas is inversely proportional to the pressure, i.e. the pro- 
duct of pressure by volume depends only upon the temperature: 

pV(p, t ) = p 0 V(p 0 , t) (7) 

where p 0 denotes some standard pressure, say 1 atm. 

2. The Law of Gay-Lussac . — The isobaric volume coefficient has 
the same value a = 1/273 *1, independent of the temperature, for all 
gases. Thus by the defining equation (2) (p. 493), 


V(p, t) = V(p, 0) (1 + at) (8) 

and in particular 

V(p 0 ,t)=V(p 0 ,0)(l + at) (8') 

Combining (7) with (8') we obtain 

pV(p, t) = p 0 (l + ai) V(jp 0 , 0) (9) 
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We now change the zero of the temperature scale from the temperature 
of melting ice to the temperature —1/a = -273-1° C., and take 


pv - 1)r -fg; r ■=-»'. . . CD 

We have already referred to this scale of temperature as the absolute 
scale. The constant A, which is equal to 1/273-1 of the product of the 
pressure and volume of the gas at 0° C., varies according to the mass 
and nature of the gas under consideration. However, we obtain a 
universal gas equation by using another fact of experience as follows. 

3 . The Law of Avogadro , — The densities of two gases which are at 
fche same pressure and temperature are proportional to the molecular 
weights. If M is the mass of the gas, the density is M/V and so 


where ni v nu are the molecular weights. Equation (12) is equivalent to 

an 

The quantity 

mV 

M~ V 

is the volume occupied by one mol. This, as well as all symbols referred 
to one mol (gramme molecule), will be denoted by small letters. 

If we write the gas equation for one mol {F — »), the constant on 
the right becomes the same for all gases, since by Avogadro’s Law, 
one mol of any gas occupies a given volume v g at a given temperature 
and pressure. We then have 


The numerical value of this so-called universal gas constant is 
R — 8-314 X 10 7 erg/deg. mol — 8310 int. joule/deg. kg. -mol. 

Since the mass of one mol is m grammes, where m is the molecular 
weight, multiplication by Mhn yields the following result for any mass 
M of gas M 

P V = -RF^nRT, ...... 


where n is the number of mols of gas present. 


(13') 
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The produot pv vanishes for T = 0. This means that an “ ideal ” 
gas which obeys (13) would have zero volume for a finite pressure, or 
would no longer exert pressure, the volume still being finite. The 
former alternative seems impossible to visualize. Now the atomistic 
view reveals at once the nature of the simplification inherent in the 
definition of an ideal gas which leads to such paradoxical results. 
According to the kinetic theory of gases, the pressure exerted by a 
gas on the walls of its container is due to the innumerable impacts of 
the rapidly-moving molecules. The molecules themselves may be 
assumed to be particles having no extension, so that the entire volume 
v is available to every molecule. The result is that at absolute zero, 
where the molecules no longer are in motion, the volume of the gas 
is zero. In reality we know, however, that the colliding molecules 
behave like elastic spheres whose diameter is of the order of 10 -8 cm., 
so that they occupy a small, but nevertheless finite, volume. Further, 
in deriving the gas equation from the kinetic theory, it is assumed 
that the molecules exert no forces on each other. We know, however, 
that large intermolecular forces of attraction exist in liquids and solids 
(cohesive forces) and it is thus probable that such forces exist in the 
gaseous state also, but that they are much feebler, corresponding to 
the greater average distance between molecules. 

We may now derive the equation of state of a mixture of ideal gases. 
Let the masses M x , M 2 , M z , ... of gases having molecular weights 
m 2 , ... be contained in a volume F. In order to obtain the 
equation of state for this mixture we need the following law. 

4. Dalton’ a Law of Partial Pressures. — The pressure exerted by a 
mixture of gases is equal to the sum of the partial pressures which each 
gas itself would exert if it filled the entire volume. From the kinetic 
theory standpoint this law is a matter of course, since the total effect 
of the collisions with the walls must be equal to the sum of the effects 
due to the separate kinds of molecules. Thus for the first component gas, 

Pl F = ^BT=n 1 BT, 

and similarly, 

p 2 F = ^2 RT = t^RT. 
m% 


Adding all such equations and applying Dalton’s Law, we obtain 

f>F= + . . . W= (Ml + «*+ n 3 + • • .)RT—nRT (14) 

V*i / 


»1 + «2 + n 3 + ■ • • 


Using this relationship we can define the apparent molecular weight 
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of the mixture as the molecular weight m of a gas which, for the same 
mnJg would satisfy the same equation of state, viz. the equation 


p v = - RT = RT. . (15) 

m m 


Comparison of (14) and (15) gives the apparent molecular weight of 
the mixture as 


- M x + M 2 + M s + ... 

Wj m 2 m 3 


(16) 


3, The Equation of State of a Real Gas. 

The atomistic view shows us how to improve the equation of state 
of a gas: it is necessary only to take account of the quantities which 
were neglected in arriving at the equation of an ideal gas. Firstly, we 
must subtract from the macroscopic volume v a quantity b which 
allows for the space occupied by the molecules. This quantity b is four 
times the volume actually occupied by the molecules, which are assumed 
to be spherical. The reason for this is that in a collision the centres of 
two molecules approach no nearer than a diameter, so that for impact 
we may imagine half the molecules to be mere points, while the re- 
mainder occupy eight times their actual volume. Hence, for the 
average, we use four times the true volume of the molecules. 

The cohesive forces operating on any given molecule in the interior 
of the gas cancel out, since they are essentially equal in all directions; 
not so, however, for a molecule near a wall of the vessel. It will be 
drawn back somewhat into the interior of the gas, if we assume no 
attraction between the gas molecules and the wall (adhesion). This 
has the effect of a small inward pressure in addition to the observed 
manometric pressure p. This added pressure must be inversely pro- 
portional to the square of the volume, for the force acting on a given 
colliding molecule is proportional to the number of molecules in the 
interior, which, in turn, is proportional to the density of the gas. But 
the pressure increase is also proportional to the number of molecules 
colliding with the walls, and hence again proportional to the density. 
Thus, in place of Boyle’s Law we write the following equation for one 
mol of gas: 

(p + £)(»-*>)= (Po+fy^o-b)- - • ( 17 ) 

The product is again set proportional to the absolute temperature, 
but the constant R is to be replaced by gas constants R' which differ 
somewhat for various gases. The resulting equation of state is named 
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after van der Waals, to whom most of the above considerations are due: 

(l> + £)(v-b) = B?T. (18) 

In order to obtain a view of the relationships embodied in this formula 
we draw the lines T — const, in the up-plane (fig. 1). These lines are 
called isothermal lines, or simply isotherms. The isotherms of an ideal 
gas are the rectangular hyperbolas pv = const. The greater the value 
of T, the less important are the correction terms in van der Waals’ 
equation. This means that the isotherms for higher values of T are 
not sensibly different from the hyperbolas of an ideal gas. For small 



values of T, on the other hand, the isotherms are of a different type, 
each one having a maximum and a minimum of p. The transition 
between the two types is a curve having a point of inflection with a 
horizontal tangent at which both extrema coincide. If, now, the 
point representing the state of a gas moves inward from large values 
of v along a low isotherm, the pressure increases as the volume is 
decreased. This is necessary in order to reach equilibrium, since we 
may imagine the volume to be decreased by loading a piston which 
confines the gas in a cylinder. After we reach the maximum of the 
curve, however, the pressure begins to fall as the volume is further 
decreased. There is an unstable region from A onward, since the 
internal pressure no longer increases if we seek to decrease the volume 
by loading the piston. Actually, this portion of the curve is not tra- 
versed at all; part* of the gas begins to liquefy at B, and the pressure 
remains constant as the volume is further decreased. In the meantime, 
more and more gas is liquefied, until it is all in the liquid state at 0, 
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aiMt the remaining branch of the curve gives the connexion between 
volume and pressure of the liquid. We shall find later that the hori- 
zontal line is to be so drawn that the two shaded areas are equal (cf. 
Ex. 124, p. 524). 

Curves having no extrema can have no regions of this kind; thus 
is explained the fact that above a certain critical temperature it is im- 
possible to liquefy a gas, no matter how great the applied pressure. 
The corresponding curve is the one having the point of inflection. 
The co-ordinates T k , p k , v k of this point are called the critical tem- 
perature, critical pressure and critical volume. The following equations 
hold for these quantities: 

(19) 

and since the tangent is horizontal, 

\3«A («* ~ b ) 

Since there is a point of inflection, 

2R'T k 


+ ,Ts- 


6a 


vwr 


’ («* - t>) z v k *' 


( 20 ) 


( 21 ) 


These equations are readily solved for the critical values; for 
example, division of (20) by (21) yields 


v k — 36, 

whence, from (20), 

T _ 8a 


( 22 ) 

(23) 


and from (19), 


(24) 


Combining the last three equations gives the relationship 

P*»k _ 3 /0KS 

WT k ~ 8 • ‘ * (25) 


for the oritical point. For an ideal ga3, the expression pv/RT is equal 
to unity. 

It is also possible to express the three constants of the van der 
Waals equation in terms of the critical constants: 

b ~% a=3p ik iv i j 

«•=!» j 


( 26 ) 
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The equation of state then becomes 

('+W-*)-! 1 # ■ • • (27) 

If we now introduce the following “ reduced variables 



we obtain an equation free of the constants of individual gases: 

(n+ i)(3c-l)=8T (29) 

The existence of a reduced equation of state depends solely upon 
our being able to represent the facts by a formula containing three 
constants* This is possible only to a limited extent, since many sub- 
stances undergo chemical changes as the density increases (e.g. as- 
sociation). Such changes are not accounted for by the van der Waals 
formula or by any other three-constant equation, since they are entirely 
of an individua 1 nature. The existence of a general equation of the 
type (29) is known as the Theorem of Corresponding States. While 
the above considerations show that this theorem may not be valid in 
some cases, it may usually be employed advantageously in dealing 
with chemically similar substances. 

Ex. 119. Derive expressions for the compressibility (or for the volume modu- 
lus of elasticity) of an ideal gas and of a van der Waals gas. 

* A reduced equation of state is still obtained if an individual gas constant is used 
in place of the universal constant B. The former, too, is eliminated in obtaining the 
reduced equation. 



CHAPTER XXX 


The First Law of Thermodynamics: 

The Conservation of Energy 

1. Formulation of the First Law. 

The First Law makes a statement concerning the nature of heat. 
It asserts that heat is a form of energy, and that in a closed system the 
total store of energy, i.e. the sum of the heat energy and mechanical or 
electrical energy, is constant. This is the Principle of the Conservation of 
Energy, formulated by Robert Mayer in 1842. Thus, in a closed system, 
one form of energy may be transformed into another, but the sum 
must remain constant. In the atomistic treatment of the theory of 
heat, we shall see that heat is a manifestation of the kinetic energy of 
the atoms and molecules of the substance, the difference between this 
and the evident kinetic energy of large bodies being the degree of 
organization. If, for example, a vessel of gas is in motion with a 
certain velocity, we attribute to all the molecules an observable 
mechanical energy— the common kinetic energy of motion of the centre 
of gravity. But in addition to this, there is the kinetic energy of the 
molecules, flying about at random and colliding with the walls, which 
we term heat. Since disorganization is the essential property of that 
part of the energy knojra as heat, oner can apply the concepts of the 
theory of heat only to systems composed of, many elementary par- 
ticles. A single atom cannot be said to have a temperature. 

If a quantity of heat represents a definite store of energy, there 
must be a definite relationship between the calorie and the erg. It has , 
been found experimentally (Joule) that 

1 calorie — 4-184 X 10 T erg; 1 kg. cal. == 4184 joules. 

In what follows we shall assume all energy quantities to be expressed 
in ergs, so that no conversion factors are required in the formulae. 

We can give another formulation of the First Law — one which is 
of a negative nature and which is important on account of its imme- 
diate intuitive nature and its analogy with the remaining laws. Since 
the store of energy of an isolated system is constant, it is impossible 
to construct a machine which does work without obtaining energy from 
an external source. This form of the energy principle is termed the 
601 
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theorem concerning the impossibility of constructing a perpetuum 
mobile of the first kind. 

We must obtain a mathematical expression of the First Law. To 
this end we introduce the total energy V as a function of the thermo- 
dynamic variables. Evidently this quantity must be a single-valued 
function of the independent variables, say of T and v, for otherwise 
it would be possible to construct a mechanism which creates energy. 
For example, if the 'energy content were greater when 1 litre of air is first 
warmed from 0° to 100° C. and then compressed to 10 cm. 3 than when 
the same amount of air is first compressed to 10 cm. 3 and then heated 
to 100° C., it would be possible to gain energy by bringing the substance 
back to its initial state by the second process, which requ»?es the 
smaller amount of energy. We must therefore have 

V = U(T, V) or U(T, p) or U(p, F). . (1) 

Hence the differential 




is the total differential of a function of the thermodynamic variables. 

In contrast with this, the total quantity of 
heat supplied to a system is not a unique 
function of the variables, for it is not neces- 
sary that the part of the energy of the 
system which exists in the form of heat 
be supplied as heat; it may be supplied in 
part, for example, as mechanical energy in 
compression. As a result, a small quantity 
of heat supplied to a system is not the total 
differential of a function of the thermo- 
Fi*. i dynamic variables; it is therefore denoted 

by 8Q instead of by dQ. 

The internal energy of a system may be increased by supplying 
energy from the outside, part as heat and part by doing mechanical 
or electrical work. If we take every quantity of energy supplied to 
be positive, regardless of what form it is in, the First Lawjs 

SQ + 8W = dU (3>k 

The commonest instance in which work is done is the change of volume 
accompanying an external pressure p. If every element dS of the sur- 
face experiences a displacement 8s, an amount of work 

SW = — £pdShs = — p^dSbs . . -. (4) 
is done. The negative sign is to be used, since energy is supplied to 
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the system if bs forms an obtuse angle with dS (cf. fig.-l). Now the 
integral (fids S s is merely the change in volume dV resulting from 
the displacement. Hence in this special case the First Law may be 

w&ten s q _ p d v = dU (5) 

or SQ = dU + pdV (5') 

This last form may also be interpreted thus: the energy supplied to 
system in the form of heat goes in part toward increasing the 
internal energy U and in part to performing external work. 

With regard to the pressure, the following may be added: If the 
outside pressure p on the freely movable piston of a gas-filled cylinder 
is equal to the gas pressure, the piston will remain at rest. If the ex- 
ternal pressure is much diminished to a value p', a sudden expansion 
will follow and the gas will acquire energy of flow which, however, is 
soon converted into heat because of turbulence. In this instance, then, 
the useful work obtained is smaller than if there had been only an 
infini tely small difference between inside and outside pressure, and 
hence an infinitely slow expansion. The latter condition would ob- 
viously yield the maximum external work, and we term the process a 
reversible one. Unless otherwise remarked, we shall assume reversi- 
bility for all processes discussed. 


2. Speoiflo Heat at Constant Volume and at Constant Pressure. The 
Energy Function of a Has. 

According to p. 487', the specific heat per gramme or per mol repre- 
sents that quantity of heat necessary to impart to one gramme or to 
one mol a temperature increase of one degree. One must differentiate, 
however, between the case where heat is supplied at constant volume 
and that where the pressure is held constant while the heat is sup- 
plied. In the first instance no external work is done, while in the 
second, expansion against the outside pressure does mechanical work. 
This distinction is important chiefly for gases, on account of their 
large coefficients of thermal expansion. If we apply the First Law to 
the energy supplied, the definition of specific heat per mol yields the 
equation 

8 q _ du 4- pdv 

C ~~dT If ' * * * 


(6) 


Now the energy u is a function of the thermodynamic variables, of 
which we take T and v to be the independent ones. We then have 
for constant volume 
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and the specific heat at constant volume, per mol, is 



In the second case v is not constant and 

(jr). dT+ W)M iv - cjlT+ [(i£) (9) 

We now express v in terms of T and p and obtain, for constant 
pressure, 

*=( 11 )" < 10 > 

Division by dT yields 

This equation takes a particularly simple form for ideal gases. In 
this case the energy function u{T, v) may be given explicitly. Since 
there are no forces acting between the molecules of an ideal gas, the 
energy must be independent of the distances between molecules, i.e. 
it must be independent of thewvolume of the gas. Hence, for an ideal 
gas 



This, together with equation (8), gives the energy function of an ideal 
gas as 

u = f\dT + u 0 . . (13) 

Jo 

Here tig is a constant of integration independent of T and v — the 
zero-point energy — which is set equal to zero in the classical thermo- 
dynamics. The fact that the internal energy of an ideal gas is inde- 
pendent of the volume is experimentally verified by the expansion 
experiment of Gay-Lussac. Two large glass bulbs connected by a tube 
fitted with a valve are immersed in a water bath which serves as 
a calorimeter. At the start one blub is evacuated, the other is filled 
with air. When the valve is opened, the pressures in the bulbs are 
equalized, but the water bath is found to experience no change 
in temperature, showing that the internal energy of the gas has 
not changed after the expansion. It is true that there occurred, 
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at first a transformation of heat energy (disorganized energy) 
into kinetic energy of flow (organized energy), but the latter was 
converted back into heat when the gas impinged on the walls of 
the originally empty bulb. In all, the heat energy of the gas remained 
unaltered. 

The equation of state may be expressed as 


Using this and (12) in (11) we have 

c 9 — o, — R (14) 

for ideal gases. This relationship is known as Mayer’s equation. 

For a real gas the internal energy is, to a certain extent, dependent 
upon the volume, for if there are forces of attraction between mole- 
cules, work is done against these forces during the expansion. The 
greater the intermolecular distances, i.e. the greater the volume, the 
greater is the potential energy corresponding to these forces. Using 
van der Waals* equation, it is easy to obtain an expression for the 
dependence of internal energy upon volume. If there is a small positive 
ohange of volume dv, an amount of work 


is done against the cohesive forces. Since the added term for real 
gases is always small, we set the internal energy equal to the sum of 
a term depending only on the temperature and one depending only 
on the volume: 

«=« 1 (T) +«,(*), ie. (ff) r =^T- • • < 16 > 

If no energy is supplied from without, the change of the second term 
is equal to the work done on the system: 

du t = d pdv= +dW = + - 2 dv, 


(du\ _ 
\dT/~ Cv ’ 

- fc v dT ~l + * 

Jo V 


( 18 ) 
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If the specifio heat is independent of the temperature (this is true 
about as far as 1000° abs. for all gases except hydrogen), then 

U = c^-|+u 0 (18') 

The constant Uq is subject to the same remarks as in the case of an 
ideal gas (p. 483). 

For a van der Waals gas we then have 

c,-e,.-(p + i) (|;) ; (19) 

Differentiation of the equation of the state gives 



Putting this value in equation (19) and using the equation of state, 
we obtain 

C * -C,,== , 2o(u-6)2* 

v*R'T 

or, replacing R' by R, we have approximately, since a and b are small, 

m 

for a van der Waals gas. 


3. Adiabatic Change. 

So far we have considered processes in which one of the primary 
variables T, p or v was held constant. Instead, a process may be 
characterized by holding constant any function of the thermodynamic 
variables. Adiabatic processes are of this kind. The function which 
is held fixed in this process is the entropy; this function will be dis- 
cussed in the next chapter. Adiabatic processes are, as the name 
indicates, those in which no heat is communicated to or abstracted 
from the system. This condition may be realized in practice in two 
ways : either the entire system is enclosed in a heat insulating envelope 
or else the processes take place so rapidly that there can be no appre- 
ciable heat transferred by conduction even when the heat insulation 
is imperfect. 

The First Law famishes at once a differential equation for the 
adiabatic process: 


du + fdv — 0. 


( 21 ) 
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If m is expressed in terms of p and v, this equation represents a family 
of curves^ known as the adiabatic lines or simply adiabatics. The 
equation of the adiabatics is especially simple for ideal gases: since 
we have 


iM~c.iT and 


and since R — 


(21) becomes 



Using the symbol y for the ratio c 0 /c, and integrating, there 
results 

pyv 3= p 0 v Q y — const (23) 


Since y > 1, the adiabatics are always steeper than the isotherms 
(jyf) = const.) in the pu-diagram. If we represent the relationship 
between T, p and visa rectangular system of co-ordinates, equation 
(23) represents the projection on the jw-pl&ne of a space curve lying 
on the surface pv — RT, Making use of the equation of state, we find 
that the other projections are given by 

Tvf~ l — T 0 V _i == const. .... (24) 
and Tp( 1 -r)/r= T 0 p 0 a-r)/i-== const. . . . (25) 

Thus y may be determined for an ideal gas from the adiabatio ohange 
in volume. A method which is" more general, since it is not restricted 
to ideal gases, is that which employs fixe velocity of sound in the gas. 
The relationship 

/modulus of elasticity ae 

W= V .... (26) 


given on p. 183 for longitudinal waves in solid bars is valid also for 
fluids if we use the volume modulus of elasticity (see p. 213). If we 
insert the numerical values of density and modulus of elasticity for 
air, which (Ex. 119, p. 500) is equal to the pressure, for an ideal gas, 
we obtain a value of v which does not agree with observation. Newton 
was the first to compute the velooity of sound in this way. The sourcje 
of the error in his result was pointed out by Laplace. It is not that air 
was incorrectly assumed to be an ideal gas; the discrepancy is of a 
much greater order of magnitude. The true reason is that the compres- 
sion take place so rapidly in a sound wave that the resulting heat does 
not have time to dissipate itself; hence the process is adiabatic, while 
the modulus of elasticity in the example, p. 500, assumes that the 
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temperature is constant. Now it is readily shown that the adiabatic 
modulus of elasticity is equal to y times the isothermal modulus. To 
prove this, take p and v as independent variables. The differential 
equation of the adiabatics is then, by (5) (p. 503), 

(j s)*+[(s). + ']*“*■ ' ' • m 

On account of equation (8) (p. 504), this gives 



The existence of an equation of state connecting the variables, say 
f(p, v , T) — 0, implies 

so that, as at (5) (p. 494) 


Titus we need only show that the bracketed expression in (28) is equal 
to c„ in order to prove our statement. If we now take T and p as inde- 
pendent variables, we have, by the First Law, at constant pressure, 


Thus (28) becomes 


‘•-(hD'+Kct)."- 
C '“H = *(!?) + (! i): ■ ■ • (30) 
C4-C4- « 
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and so, from the definition of E, 

E »d = Y E u • • (31') 


The value of the adiabatic modulus is obtained from the speed of sound, 
while static experiments give the isothermal modulus. Hence y may 
be computed. 

An adiabatic process which has found an important technical 
application in the air liquefying machine (Linde) is the so-called 
throttled expansion of a real gas. In the Gay-Lussac expansion 
experiment (p. 504) the gas in the first bulb attained kinetic energy of 
flow and was at the same time Cooled; the energy was again converted 
into heat, however, when the motion was arrested in the second vessel, 
so that within the somewhat large limits of error no net change in 
temperature was observed. The accuracy of observation is increased 
very much if the expansion is regulated so that no energy is converted 
into energy of flow. This is the so-called “ porous plug ” experiment 
of Joule and Thomson (Kelvin). With this arrangement it is not 
necessary to wait for the system to come to equilibrium; rather, the 
temperature of the expanded gas itself may be measured. Since this 
gas has received no appreciable kinetic energy,, its temperature is a 
measure of the change of internal energy accompanying the increase 
in volume. The apparatus is represented diagrammatically in fig. 2. 

By continuous motion inward 

of the piston 1 and continuous , EL ^ "1 1 />. |j 2 , 

withdrawal of piston 2 the pres- H ! J D I — h 1 

sure Pi is maintained on the left, Fig- * 

the pressure p 2 on the right. In 

the original experiments the throttle D was a plug of cotton or silk. 
If one mol of gas is forced through in this manner, the amount of work 
done on the left is pjV lt while the gas has done an amount of external 
work p 2 v 2 on the other side. The increase in the internal energy of 
this mol of gas is then the excess of the applied work over the work 
obtained from the system: 

«* — «i — Rth — PM 

or «a + PM = «i + Pi v i (32) 

In this process, then, the “ heat function s ’ or enthalpy, or total heat 
h—u-\-pv remains constant. Sines each term is a function of the 
thermodynamic variables, the enthalpy is also a function of these 
variables. The change in temperature AT accompanying a change in 
volume Av is readily given for a van der Waals gas: 

A=u 0 + Cv T-?+ 


. . (33) 
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whence it follows that 

Neglecting b as compared with v and making the further approxi- 
mation of replacing R' by R and o v + R by c„, we obtain, after solving 
for AT: 


A T = 


£rJ RTb ~ 2 


(35) 


In this experiment the temperature difference that arises repre- 
sents the difference between two opposing processes. Expansion 
makes the attraction between molecules produce cooling, since work 
most be done against these forces. On the other hand, the space occu- 
pied by the molecules, which is characterized by 6, is responsible for a 
warming effect when the gas expands. 

Of greater importance is a knowledge of the change in temperature 
as a function of the pressure change A p. To obtain an expression for 
v in terms of p which is accurate to the same order as (35), we may 
replace v by the equivalent expression RTjp from the ideal gas equa- 
tion and obtain 

AT s 


« 

Since A p < 0, there will be a decrease in temperature as long as 




The numerical values for air are such that a considerable cooling^ 
effect is obtained even at 0° C., and this effect is employed in the Linde 
machine to cool the following masses of air. For hydrogen, however, 
there is a temperature rise when the exit temperature is 0° C., hence 
this gas must be pre-cooled with liquid air. 

In reality, the approximations made above are quite crude in the neighbour- 
hood of the liquefaction point. If (34) is solved for AT without neglecting any 
terms, the factor of Av when set equal to zero no longer yields a fixed transition 
temperature but an equation connecting T and v — a transition curve. This shows 
that even at lower temperatures there are pressures at which the sign of the 
effect reverses (see following exercise). 

Ex. 120. Find the transition curve in terms of reduced variables. 


4. Application of the First Law to Thermochemistry. 

In all chemical reactions we observe a heat exchange, i.e. the 
reaction is accompanied either by the evolution of heat — a so-called 
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exothermal reaction — or heat is absorbed in the process — an endo- 
thennal reaction. Most chemical changes are of the former type. 
If the reaction takes place in a calorimeter, the heat of reaction may 
be measured. Contrary to our former practice of counting heat sup- 
plied to a system as positive, the heat of reaction in thermochemistry 
is taken to be positive for exothermal reactions. This will be expressed 
by representing the reaction heat by Q~ — —Q\ Q~ is measured in 
thermal units (calories), and is read “ Q minus ”. 

If the reaction takes place without doing external work (i.e. for 
gases, at constant volume), the heat of reaction gives the change of 
energy of the system resulting from the reaction. On the other hand, 
if the process takes place at constant pressure, then — practically 
only in the case of gases — the external work must also be taken into 
account. If the volume of the system is F x before the occurrence of 
the reaction and F 2 after, this work amounts to p{V 2 — F x ). Desig- 
nating the two heats of reaction by the subscripts v and p, as done 
above for the specific heats, we have, at constant volume, 

= U l — U s (37) 


At constant pressure, 

Q-'-Ut-Ut-piVt-VJ 
or Q-,= U 1 +pV 1 -(U a + pV a ) = W 1 -W t 


(38) 


Thtis the heat exchange at constant pressure gives the change in 
enthalpy H before and after the reaction. Since it is sufficiently 
accurate to consider the gases ideal in thermochemistry, we have 


pF x = r^RT 


and so Q~„ — Q~, + (n^ — n^RT — Q~, -f A nRT. . (39) 


where An is the difference in the number of mols of gaseous com- 
ponents of the system before and after the reaction. 

For reactions at constant pressure, the enthalpy is the same »..» the 
energy content; moreover, in no case do these two quantities dider 
much. We adopt the following notation for thermochemical equations: 
The enthalpy H ol mol of a substance in the liquid state is de- 
noted by the chemical symbol written without brackets, for the solid 
state by the symbol in square brackets, and for the gaseous state in 
parentheses. Thus the thermochemical equation 

[Pb] + [£] = [P6(S] + 18,400 cal 


or [Pb] + [5] — [PAS] = 18,000 cal 


means that one mol of solid lead and one mol of solid sulphur together 
have an energy content 18,400 cal larger than one mol of lead sulphide. 
The main significance of the First Law is that energy content 
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and enthalpy are functions only of the thermodynamio variables; 
hence the energy differences must always be the same, whether the 
reaction consists of a single process or involves intermediate steps. 
This fact was expressed by Hess before the First Law was known. 
Hess’s Law enables us to combine thermochemical equations; this 
often makes possible the calculation of the heat of reaction in cases 
where it would be extremely difficult to measure, or for reactions which 
are purely hypothetical. In organic chemistry, for example, the heat 
of formation of a complicated compound is determined by combustion, 
first of the compound and then of the constituents. The combustion 
usually takes place in a bomb calorimeter at constant volume. In 
order to show that this method yields a sufficient number of equations 
for calculating the heat of formation we consider the formation of 
liquid benzene from solid carbon and gaseous hydrogen, the process 
occurring at constant pressure. From the reaction in a bomb calori- 
meter (constant volume) we find 

C 6 H 6 + ¥(0 2 ) — 6((70 2 ) - 3 H 2 0 = 781,000 caL 

Since the data on the remaining reactions refer to constant pressure 
conditions, we convert the above reaction to constant pressure also. 
In the process under consideration, 3/2 mol of gas vanishes, and since 
R is approximately equal to 2 cal, we must add 900 cal to the right 
member of the preceding equation, assuming room temperature to be 
about 300° abs. This quantity of heat is relatively small compared 
with the total amount, showing — as indicated above — that no great 
error is involved in replacing Q~ P by Q~ v , especially since the heat 
of reaction cannot be measured with any great accuracy. In addi- 
tion, we have the following data for the combustions at constant 
pressure: 

3(H 2 ) + |(0 2 ) - m % 0 = 205,200 cal 
and 6[C] + 6(0 2 ) - 6(C0 2 ) = 566,400 cal. 

Increasing the right member of the first of these three equations by 
the required 900 cal and subtracting it from the third we obtain 
6[C] - f(0 2 ) + 3 H z O - C 6 H a = — 215,500 cal. 

Adding the second equation to his, 

6[C] + 3 (H a ) — C 6 H 6 = — 10,300 cal. 

Thus the heat of formation of benzene from carbon and hydrogen is 
negative — energy must be supplied to make the reaction occur. It 
must be mentioned, however, that the heat of formation is not the 
correct criterion for the occurrence of the process, although in general 
— as in the present instance — exothermal reactions are easier to pro- 
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^ ttce are endothermal. This is known as Berthelot’s Principle. 

The First Law of Thermodynamics enables us to find the law of 
dependence of heat of reaction on temperature. For a reaction taking 
place at constant volume, 

»'aO+ *-66 + ... = v,fi+ v d $ + . . . , . . (40) 

where v b , . . . are the numbers of mols, and a, b, . . . are the formulae 
of the substances entering into the reaction. At a temperature T, 
(v.o)r+ (*'»&),•+• ...—(VeC),— (v d d) r — ... = q~ ,(T), (41) 
and at a temperature T -f- dT, 

( v a a ) T+il + (*'»&)*+«• + • • • (»'«C), +4 , (v d d) r+dT — ... 

-q-,(T)+^dT (42) 

Now at constant volume 

du = c v dT, ....... (43) 

so that 

(>'«a)f+ v a c„dT + (v 6 6),4- v b c, b dT + . . . 
v e c V0 dT— Mr- dT — ... = q~ v (T) + %fdT, (44) 
or — 2 v t c vt . (44') 

In this summation, the number of mols is to be counted positive before 
the reaction and negative afterward. Exactly the same equations 
result for reactions at constant pressure, only it is necessary to re- 
member that tho heat is supplied at constant pressure, so that c» is 
to be replaced by c p : 

^ = 2^ (45) 

Over a small temperature interval, c„ or c„ may be considered constant; 
this is no longer allowable when large intervals are considered. For 
example, if we wish to introduce the heats of reaction at absolute 
zero instead of those observed at room temperature, it is necessary 
to take 

r.(^) = ST* + 2v, f'c w dT and q~ v {T) = ?-*, + Ev, f'c pi dT. (46) 
Jo Jo 

These integrals may be evaluated if the variation of the specific heats 
is known. 
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CHAPTER XXXI 


The Second Law of Thermodynamics: The Law op 
Entropy 

l. The Carnot Cycle and the Ideal Heat Engine. 

The First Law of Thermodynamics makes a statement concerning 
the energy balance which must be maintained in any transformation 
between mechanical or electromagnetic energy and heat. This law* 
however, does not tell whether or not this transformation will take 
place under given conditions, nor does it say what portion of the 
energy will be transformed. Even before the First Law was discovered 
it was known that any amount of mechanical energy could be com- 
pletely converted into heat at will, e.g. by friction, but only a partial 
conversion of heat into mechanical energy could be realized with the 
aid of heat engines. By studying these relationships, Carnot * was 
able to represent the processes taking place in a heat engine by means 
of a cycle which, as we shall, find, represents the best possible heat 
engine. This ideal machine makes use of two large reservoirs of heat 
which are at temperatures T x and T z respectively, with T x > Tj.f 
The working substance may be taken to be one mol of an ideal gas 
enclosed in a cylinder equipped with a frictionless piston. By means 
of a frictionless guiding mechanism of some kind, the cylinder may 
be placed in thermal contact with each of the reservoirs in turn, or 
may be completely insulated from them. We provide, further, that 
the processes which the working substance undergoes be reversible. 
By this we mean the following. If the gas is allowed to expand by 
decreasing the external pressure, this must not be done by a sudden, 
finite change of pressure, but in such a way that the external pressure 
differs only infinitesimally from the internal; thus the piston receives 
no appreciable kinetic energy which would require to be removed by 
braking. Under such conditions the process may be carried out equally 
well in the reverse order, provided that the external pressure is a little 
greater than the internal. In this case the work done differs only in 
algebraic sign from that done in the expansion process. It is evident 

* S. Carnot, Reflexions sur la puissance motrice du feu, et sur les machines propres 
i ctevelapper cette puissance, Paris (1824); Eng. trans. by R. H. Thurston (1890). 

t These heat containers are sometimes referred to as “ source " and “ sink " or 
as “ boiler ’’ and " condenser ” respectively. 
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that the rapidly ’ ".inning engines used in practice are very different 
from the ideal Carnot mechanism. 

A cyclic process or cycle is a process in which the machine is again 
brought back to its initial state; this is a necessary property of a 
machine that is to run arbitrarily long. Let us cause the working 
substance to pass through the following cycle (fig. 1); 

2 


Fig. J 



1. Connect the cylinder with the source of heat at temperature T t 
(the boiler) and reduce the external pressure (e.g. by removing weights 
from the piston), thus allowing the gas to expand. In doing so, the 
gas does work against the external pressure and the equivalent amount 
of energy is taken from the boiler. Since we assumed the boiler to be 
very large, we need not consider its temperature drop during the pro- 
cess and so may take the change in volume of the working substance 
to be isothermal. If the gas expands from the volume to ti 8 the 
external work done is * 

W - lt = £pdv = RT,£ ^ = RT, log^j. . , (1) 

2. The cylinder is now removed from the boiler and insulated. 
The gas is then allowed to expand adiabatically until its temperature 
has dropped to T a , that of the condenser. By equation (24) (p. 807) 
the volume v 3 at that temperature is given by 

( T \ 1/(1— y) /r 

&) -%(£) — ( 2 ) 


since T a — T v Thus the work done in this adiabatio 

“* dv 


= j Cpdv = p 2 Vjf* ^ 


1-y 


(Ugi-y— t, 2 i-y) = 


. m 


* As in the case of quantity of heat, the work done on a system is reckoned positive. 
When, infrequently, we count the work done by a system as positive we denote it by 
W~ (“ W minus ”), i.e. W~ — - W. 
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or, by (2), this is equal to 


RT 2 
1 ~y 


/£?_ ]Y 

\t 2 ) 


~ (T 2 - T s ) = JL (T, - T 3 ). 


3. The cylinder is now placed in thermal contact with the con- 
denser (at temperature T 3 ) and the gas is isothermallv compressed to 
a volume v t such that if it be adiabatically compressed from this point 
to the initial volume tq it attains the initial temperature T v This 
volume is given by 


/ T \ua-y) /nn 


. w 


since T t = T a . The work done in this compression is negative, since 
it is really supplied to the system. Its magnitude is 


W~ u = J 'pdv— RT Z log^~. . ..(d) 

The heat generated is, delivered to the condenser. 

4. The cylinder is removed from the condenser, is thermally 
insulated and compressed adiabatically to v v An 'amount of work 

W~ u = Qpdv = (T t - Tj) = (T 3 - Tj) . (6) 


is thus done. The same remarks apply to the algebraic sign as in the 
third phase of the process. - 

The machine itself is now in its initial configuration. A certain 
quantity of thermal energy has been abstracted from the boiler (tem- 
perature f\) and delivered to the condenser (temperature T 3 ) and, in 
addition, a certain amount of mechanical work has been done. This 
is given by the sum 

JP~12 + 1P~23 + ^~84 + ^~41 » 

and is represented in the so-called indicator diagram (fig. 1) by the area 
enclosed by the path covered in the process. Hence external work is 
done when the path is traversed in such way that the enclosed area is 
always on the right, as is evident at once by considering the expansion 
from Vi to v 2 , ixp which the work done is represented by the area be- 
tween the curve p = /(«), the axis of abscissae, and the lines through 
points 1 and 2 drawn parallel to the p-axis. Adding the amounts of 
work done in the individual steps, one sees at once that the amounts 
along the adiabatics cancel out and there remains 

W- = RT X log ~ 2 + RT 3 log -< 
v t v z 


( 7 ) 
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« 4 _ 
v 3 v 2 

. . .- (8) 

Hence 


W~ — R (F 1 — T 3 )log . . 

. . . (9) 

A quantity of heat * 


Qx = RT X log . . . . 

. . (10) 


equivalent to the work done in the first step of the j®ocess, was 
removed from the boiler and delivered to the engine. Of this the 
fraction 



was converted into useful mechanical work, while the quantity 
of heat 

Q-, = RT 3 log | = Q 1 (1 - 1) . • (12) 

was returned to the condenser. The fraction ij is called the efficiency 
of the heat engine. Since all the steps of the process are reversible, 
the engine may be used also to extract a quantity of heat Q„ from the 
condenser, delivering it to the boiler. This requires the expenditure 
of an amount of work 

W=vQ~ (13) 

1 — ij 

The efficiency of the Carnot engine is the highest attainable by any 
means. To show this, assume that there exists a “ super-Carnot engine ” 
with still higher efficiency rj. By coupling this machine to a Carnot 
engine we would have a heat engine which — while it would not be 
a perpetual motion machine, since it does not violate the First Law — 
would nevertheless be equivalent to one from the practical standpoint. 
Such a mechanism is called a perpetuum mobile of the second kind. Let 
the super-Carnot engine take a quantity of heat Q x from the boiler, 
converting 5) Q x of this into useful work and returning (1 — ri)Q x to 
the condenser. This quantity of heat, however, may be recovered 
immediately by means of a Carnot engine; by equation (13), this 
requires an amount of work 
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In all, the combination has done an amount of work 

w-„ = W. - j— ^ r,Q* = . . (14) 

1 — V 1—7] 

which is positive. The thermal equivalent of this work has been -ab- 
stracted from the boiler without the delivery of any of it to the con- 
denser, which therefore is not required at all. This means that the 
mechanism would do mechanical work and would cool the boiler 
without causing any alteration whatsoever in the condition of any 
other body. Since enormous sources of heat are available (e.g. in the 
oceans), a machine of this kind would be fully as important as a per- 
petual motion machine. Experience shows, however, that it is im- 
possible to construct a machine of this kind, and this is precisely what 
the Second Law of Thermodynamics says. 

Before we formulate this law, it must be mentioned that a satis- 
factory definition of the temperature scale is given by equation (11) 
(p. 517). Thus, in order to compare the temperatures of two sources 
of heat, we use them as boiler and condenser of a Carnot engine and 
determine its efficiency. The condenser is at the absolute zero of tem- 
perature if the efficiency is unity. This defines the thermodynamic or 
Kelvin temperature scale, which is identical- with that based on the 
equation of state of an ideal gas, as is evident from the derivation. By 
means of suitable thermodynamic reasoning and measurements, any 
thermometer may be reduced to the thermodynamic scale. However, 
the details cannot be given here. At not too low temperatures the 
readings of a helium thermometer agree with the thermodynamic scale. 

Ex. 121. Is the heat developed in electric resistors an- economical way of 
heating a building! 

2. 'Formulation and Interpretation of the Second Law. 

In conformity with the result of the previous section we give the 
Second Law the formulation due to Thomson and Planck: 

It is impossible to construct a continuously operating machine which 
does mechanical work and which cools a source of heat without 'producing 
any other effects. 

This is a fact of experience, just as is the Eirst Law, and can be 
confirmed or disproved only by experiment. We shall find that the 
consequences of the Second Law are exceedingly numerous and that 
the experimental confirmation of these consequences is at the same 
time a verification of the validity of the Second Law, which, moreover, 
has a logical interpretation from the atomistic point of view. 

The first consequence is that certain energy transformations in- 
volving heat are irreversible. case in point is the conversion of work 
into heat by means of friction. If, for example, a vessel containing 
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a rotating liquid is left to itself the liquid will soon come to rest, since 
its kinetic energy is soon converted into heat; this causes a slight 
temperature rise which may be measured. It would be an event 
absolutely unheard of, however, if a liquid at rest were to start rotating, 
its temperature dropping at the same time, although this would not 
be a violation of the First Law. If the process opposite to the genera- 
tion of heat by friction existed, it would have exactly the property of 
a perpetuvm mobile of the second kind. If, then, we assume the Second 
Law to be valid, such a process does not exist. 

Another irreversible process is the isothermal expansion of an ideal 
gas without the performance of external work. This is the process we 
met in the Gay-Lussac experiment (p. 504). If it were possible to com- 
press a gas isothermally without using external work, we could construct 
a perpetuum mobile of the second kind by allowing a gas to expand 
isothermally and do external work, the required energy being taken 
from a reservoir in the form of heat; then, using the previously men- 
tioned process, the gas could be compressed again to its original volume 
without requiring any work. 

Expansion without external work is represented also by the dif- 
fusion of two gases. If we remove a partition separating two gases 
in a vessel, each gas ultimately fills the entire vessel, since the entire 
space is available to the molecules of either gas. According to the 
Second Law, this process cannot be reversed without doing work. In 
this process and in the one discussed in the preceding paragraph heat 
enters only indirectly, since no changes in temperature occur. Never- 
theless, the phenomenon of diffusion throws light on the nature of the 
Second Law. We may represent the circumstances by a model as 
follows. Place a large number of white balls, representing the mole- 
cules of the first gas, in the bottom of an urn. Above these place a 
layer of coloured balls, representing the molecules of the second gas. 
If the urn is now shaken, the two kinds of spheres will mix. It is im- 
probable to the highest degree — and the improbability increases with 
the number of balls used — that the shaking will again produce a con- 
figuration in which all the coloured balls are again on top. To be sure, 
there is no physical law forbidding such a separation, just as there 
is no real reason why a number of pieces of type thrown at random 
from a box should not arrange themselves so as to spell out a given 
sentence, but the Theory of Probability shows that such a “ self- 
arrangement ” is practically out of the question. The remarkable one- 
Bidedness of the energy changes mentioned above is to be interpreted 
atomistically by the fact that the change takes place from a state of 
order to one of disorder, but not the other way. 

We now endeavour to give the Second Law a mathematical formu- 
lation from which further consequences may be deduced mathemati- 
cally. The First Law characterizes the energy U which depends only 
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upon the thermodynamic variables. Is it also possible to find a charac- 
teristic function for the Second Law which has this property? The 
Carnot cycle gives us a suggestion. After completion of the cycle, 
i.e. when the system is again in its initial configuration, we have by 
equations (10) and (12) (p. 517), 


Q 1 _Q is 
T x T s 


° r ! 


( 15 ) 


We wish to generalize this relationship so that it applies to any cyclic 
process. Imagine such a process carried out with an arbitrary system. 
The cycle is represented by a closed curve in the pT-diagram; the 
direction in which this curve is to be described is that corresponding 



to external work done by the system. Imagine, now, the pV - plane 
to be covered by a fine network of adiabatics and isothermals (fig. 2). 
If the network is taken arbitrarily fine, the work done may be repre- 
sented by the area enclosed by the zigzag line instead of that bounded 
by the curve itself. The former path is traced by moving from a point 
P 0 of the curve to a neighbouring point P x along an adiabatic as far 
as the intersection with the isothermal through P 1} and then along this 
isothermal to P lt instead of moving directly from one point to the 
other along the original curve. 

We specify, in addition, that all changes are to be reversible in 
the sense used in connexion with the Carnot cycle. Further, assume 
that the heat supplied or obtained along the isothermals is not ex- 
changed with the boiler or condenser directly, but that the work 
which can be got from it is really obtained with the help of a Carnot 
engine operating between a body at temperature T 0 and the system 
itself. The Carnot engine delivers to the boiler the heat generated 
along the isothermals, external work being used up in the process. 
Since only a small segment of each isothermal is traversed, the indi- 
vidual quantities of heat supplied to the system are small and so are 
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denoted by A Q a . The, quantities of beat wbicb the Carnot engine 
abstracts from the boiler at temperature T 0 are denoted by A Q~ 0 . 
Part of this heat is converted into mechanical work, while A Q a is 
delivered to the system in question. By equation (12) (p. 517), 

AG„ = AQ- 0 .^ or AQ- 0 = A&|?. . . (16) 

•It remains to show that the quantity of heat which, -must be sup- 
plied along the zigzag path is the same as that supplied along the 
actual curve. To this end we imagine a cycle performed along one 
of the infinitesimal triangles bordering the curve. The work done is 
given by the area enclosed by the triangle, and this vanishes to a 
higher order than the lengths of arc of the curve as the network is 
made finer. Since no heat is supplied along the adiabatic, the sum of 
the heat quantities supplied along the isothermal segment and the 
actual curve must be zero if all sides of the triangle are traversed in 
a given direction; if both segments are traversed in the same sense, 
the two quantities of heat mhst be equal.' 

After completing the cycle, the system under consideration and 
the Carnot engine are back in their original configuration, and sum- 
mation over all heat exchanges gives 

0r . . • (17) 

In the limit, when the segments are very small, the sum is replaced 
by the integral 



so that for a closed path the relationship 

or • * • (18) 

is valid. If Q~ 0 is negative, this means' that .heat is being supplied to 
the boiler at the expense of mechanical energy. This is entirely 
possible. On the other hand, a positive value of Q~ 0 would mean 
that heat is taken frori the boiler and converted into mechanical 
energy either in the Carnot engine or in the system under consideration. 
But this would be a perpetuum mobile of the second kind. In the most 
favourable case, i.e. for complete reversibility, Q~ 0 is zero. In this 
instance, the Carnot machine uses just as much work as was done by 
the system in its cycle. -Hence, for a reversible process between two 
states, the integral 
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must be independent of the path, for if two curves connecting the 
states 1 and 2 are joined to form a closed cycle, the integral must 
vanish. It follows that, when the initial state 1 is assigned, the 
integral defines a quantity which is a function only at the final state 
2. The integral 

reckoned from an undetermined initial point, is called the entropy 


8 of the system: 

(19) 

Thus, for a reversible passage from state I to state 2, 

5 ,-^= ( 20 ) 


For an irreversible change the Bign of equality (=) is no longer appro- 
priate. Whether it is to be replaced by > or by < is determined by 
supplementing the process by a reversible one in the opposite direc- 
tion, the whole process being cyclic. Since the cycle contains irre- 
versible steps we must have, according to the statements immediately 
following equation (18), 



or .... (21') 

The positive difference 

is then a measure of the irreversibility of a given process. It is easy 
to see, in special cases, why the value of the integral 



decreases for irreversible cyclic processes: Suppose we supply heat 
to the system by means of a Carnot engine, but, on account of 
friction, only a small part is converted into mechanical energy, "or 
he&t due to friction is generated in the system itself. Then, in order 
to bring the system back to the initial configuration, more heat 
must be withdrawn from the system than in the case of a reversible 
cycle. 



XXXI.] THE LAW OF ENTROPY 523 

For isolated systems, in which there is no exchange of heat with 
the environment, the integral 

rdQ 

J T 

is zero; thus, in any change, 

S % — $1 > 0. 

All changes unthin an isolated system proceed in such way that the entropy 
increases; hence changes will occur in such a system until the entropy 
has reached its maximum value. 

Thoughtless application of dS — dQfT to situations involving no 
heat exchange as such may erroneously lead to the conclusion that 
dS — 0. Let us illustrate what is 
meant, using the example of the 
diffusion of two gases mentioned 
earlier. Imagine equal volumes of 
the two gases, one above the other. 

When the separating partition is 
removed they will diffuse through 
each other and nothing else will 
happen. This process, occurring 
spontaneously, is irreversible. In a 
less simple way,. but reversibly, the 
same final state can be reached by 
making use of the fact that cer- 
tain membranes are semi-permeable. 

Rubber, for example, is readily permeable by carbon dioxide but not 
by hydrogen. For a gas that is readily transmitted by the membrane, 
the latter simply does not exist. Let the two gases now be placed in 
vessels which telescope one into another. The lower vessel ha3 a lid 
that is permeable by the gas in the upper vessel, the upper one has a 
bottom that ireely transmits the gas present in the lower vessel (fig. 3). 
The entire apparatus is in a vacuum. Thus no work is required when the 
vessels are moved together. When the two containers have been fully 
telescoped the gases are mixed, but the volume is half that occupied 
when they were mixed by irreversible diffusion. If we wish to attain 
this condition reversibly we must allow the mixture to expand to 
double its volume, in which process it does external work, but we 
must supply the equivalent amount of heat to the mixture in order to 
keep the temperature constant. This quantity of heat, divided by T, 
gives the change in entropy in the process of irreversible mixing. For 
the state attained by the reversible process, we return to initial con- 
ditions in this way: Using the work done in the expansion we compress 
the mixture, return the resultant heat to the heat reservoir and separate 
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the gases by means of the selectively permeable membranes with- 
out any expenditure of work. 

Entropy is a quantity which, like temperature, may be specified numerically, 
usually in calories per degree. The practice of giving numerical values for this 
quantity is less common because earlier there was no fixed zero point for its 
reckoning. One thus avoided such statements as “The body has a temperature of 
100° and an entropy of 300 cal./deg.” However, since the formulation of the 
Nernst heat theorem (p. 666), a definite reference level for entropy is indicated 
and such specifications are quite in order. 

Ex. 122. Calculate the change in entropy occurring when 10 gm. of water 
at 100° C. are mixed with 20 gm. of water at 16° C. 

Ex. 123. In how many points can amadiabatic interseot an isothermal! 

Ex. 124. What determines the position of the line BC in fig. 1 (p. 488)! 


3. Conditions of Equilibrium for Systems under various Conditions. 
Thermodynamic Potentials. 

Let us write (21') in differential form and express dQ in terms of 
the First Law, which, of course, holds whether we are dealing with 
reversible or irreversible processes: 


dV + 5IP- 
T 


(21") 


where W~ represents the external work, and the sign of equality applies 
when there is reversible conduction of heat. The notation SIF~ indi- 
cates that the external work need not be a perfect differential of the 
thermodynamic variables. In the reversible case, for mechanical work, 
8 W~ amounts to -\-pdV, where p is the equilibrium pressure — the 
pressure on the system at any instant. This furnishes us with a de- 
finition of entropy in terms of the thermodynamic variables in the form 
of a differential equation: 

'• <» 


In order to compute the state of equilibrium that a system will 
assume under given external conditions, we proceed as in mechanics 
by using the principle of virtual displacements. For a closed system 
characterized by the auxiliary conditions U = const, and V == const., 
Slf- is also zero, and we have 

SS^O. 


with the auxiliary conditions 

817 = 0 and 8F = 0. 

This merely repeats the proposition, given above, that an isolated 
system in equilibrium assumes a condition of maximum entropy. 

In general, however, the systems we deal with are not closed. Most 
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laboratory processes, for example, proceed at constant temperature 
because of heat exchange with the surroundings. For T = const, and 
a reversible process, (21") yields 

dW~ = -d{U -TS) = -dF, . . . (23) 

The function F is called the free energy. At constant temperature its 
decrease gives the maximum obtainable external work. Thus the free 
energy has a very vivid interpretation: Of the entire energy content, 
a part is available for any transformation, while the remainder is fixed 
and is converted into heat. In general, according to (21"), 

(24) 

Now if the volume does not change (isothermal-isometric system) no 
work is done, and on account of 8 W~ = 0, , 

0<£-8 F 

or 8F <i 0, I 

with the auxiliary conditions [ (25) 

ST = 0, 87 = 0. J 

At equilibrium, an isothermal-isometric system has its minimum value 
of the free energy. Equilibrium is characterized by the sign of equality, 
because changes will take place spontaneously until this minimum 
value is reached and — like any extremum— this is characterized by 
SF — 0. 

If the pressure in an isothermal system is held constant, then for 
a reversible process 

SF- = pdV = -d(U - TS), 

or since p — const., 

d(U — TS + pV) = dG = 0 (26) 

The function G introduced here is called the free enthalpy, sometimes 
the Gibbs potential. It differs from the free energy by the circumstance 
that the enthalpy appears in place of the energy. According to (24), 
in any isothermal system 

8F- ^8 (TS- U), 

but since 8F~ <i S(pF) and p = const., we have 
S(pV) ^ 8(TS - U), 

or 8(17 — TS -I- pV) = 8G^0, 

with auxiliary conditions 

8T = 6, 8p = 0. 


. . (27) 
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An isolhermal-isobaric tends towards a minimum value of the free en- 
thalpy, and this value characterizes the equilibrium. 

The functions introduced here are called thermodynamical potentials. 
The name comes from their formal analogy with the potential in 
mechanics. In the same way as the components of force are obtained 
by differentiation of the mechanical potential with respect to the 
co-ordinates, so all other thermodynamic variables are obtained" by 
differentiation of the thermodynamic' potentials. This makes un- 
necessary the introduction of new constants of integration which 
might give rise to difficulties. The potentials must be given as func- 
tions of two properly selected independent variables. The appropriate 
variables are those which, together with the variation of the potential 
in question, occur in the auxiliary conditions. The variables appropriate 
to each potential are as follows: 

Potential* Independent Variables 

Entropy S V, V 

Tree Energy F T, V 

Gibbs Function 0 T,p 

We then find the following relationships: From S — S(0, V) it 
follows that 

Comparison of coefficients gives 

K“)z K4); • • • • « 

In the same way, it follows from F = F{T, V) that 

dF = d*r + (* ^jdV — dU — TdS - Sdl, 

which, by equation (22) (p. 624) is equal to 

dV - dV - pdV - SdT. 

Comparing coefficients. 



* The notation used here for the potentials is largely that of Clausius. Gibbs called 
these functions ij, 1/1, and J in tbe above order. He also denoted the internal energy 
by c and the absolute thermodynamic temperature by t. See J. W. Gibbs, Collected 
Works, 1, New York, Longmans, Green & Co. (1928). 
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Also, from G{T, p) we find 

iff “ (If) dT + (&) d P = dU - Td8 ~ SdT + P dV + Vdp ' 


which is equal to 


— SdT + V dp 


by equation (22). Hence 

The relationships derived from H = H(S, p) are less important. It 
follows from equations (31) and (32) that the fre.e energy and the 
Gibbs Potential satisfy the differential equations 


(31) 


f = u + t Ktt); 


(32) 


G=U + pV+T( 11^. (33) 

These are obtained by subsStuting the expressions for S in the 
equations defining the functions F and G. 

All expressions entering into these thermodynamic potentials are 
such that they merely add when two systems are placed in contact 
and allowed to come to equilibrium. For this reason the potentials 
themselves are additive quantities. In particular, if the mass of a 
system is multiplied by n, the values of the potentials increase by this 
same factor. 


4. Connexion between the Internal Energy and the Equation of State. 

The Second Law is capable of giving a purely thermodynamic 
derivation of the dependence of the energy u on T and v — a connexion 
which was obtained in the previous chapter (p. 503 ) from partly kinetic 
considerations. If we take T and v as independent variables, we have 
for one mol: 


*-(»)"+( so* 

■Kfe)." + (£)* + H- • • (34! 

. Comparing coefficients, 

/3s\ 1 /du\ 

(dT// T(dT/ v ’ * ' 

©.-MO-M 


• (36) 
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If we differentiate (35) with respect to v and differentiate (36) with 
respect to T, comparison gives 

(nx-ai).— • • <*> 

Using the equation of state of an ideal gas, this relationship leads to 
the independence of the internal energy and the volume; if the van der 
Waals equation is used, we obtain equation (18) (p. 505). From equation 
(11) (p. 504), we obtain the general relationship for the difference of 
the specific heats: 



Ex. 125. In a voltaic cell whose electromotive force is 4> an amount of work 
Orfe is done when a quantity of electricity de passes. The expression <bde thus 
appears in place of the mechanical work pdv. The energy change is determined 
by the fact that an amount of heat equal to the heat of reaction per gramme equi- 
valent is generated when 96,496 coulombs of charge have been transported. 
What relationship exists, between the heat of reaction, the electromotive force, 
and its temperature coefficient? Hint: Introduce the charge e and the tempera- 
ture T as independent variables. 

Ex. 126. Show that for T and p as independent variables the enthalpy h takes 
the place of u and that for (2h/dp) r we get a formula analogous to (37). 

5. Electrocaloric and Magnetocaloric Phenomena. 

In our considerations of energy relating to electrical phenomena 
we did not take heat energy into account. Strictly* speaking, we would 
be concerned with the quantity designated there by u — the free energy; 
but since free energy and true energy differ but little at room temper- 
ature (they are identical at absolute zero), little error is involved. It is 
only when we "study the thermal effects connected with polarization 
that this difference must be kept in mind. 

As a starting point we use equation (36'), p. 290 , according to 
which an amount of energy E dV is supplied to each cubic centimetre of 
a body when its polarization, is changed. If, in addition, heat is sup- 
plied (but not mechanical- energy), then the First Law prescribes that 
du = 8q + EdP (39) 

We are able to work with merely the magnitudes of the vectors, since 
P and E have the same direction. Comparing this formula with the 
one that holds for mechanical work, we see that — E appears in place 
of p and P replaces v. Inasmuch as the polarization P usually refers - 
to unit volume, the formula as it stands is valid for a single cubic centi- 
metre of material. However, it may be thought of as applying to one 
mol if we interpret P to be the polarization per mol. The same equation 
is valid for the magnetic case also. In order to avoid confusion with 
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other quantities, let tlje magnetization per mol be denoted by l. Since 
the magnetic case is of much greater practical importance, the further 
development will be given in terms of these phenomena. If the me- 
chanical quantities in the equations of the last section are replaced 
by the magnetic ones, we obtain at once the difference of the specific 
heats at constant magnetization and at constant field. To interpret 
the result, consider a paramagnetic substance placed between the 
poles of a magnet and warmed. This lowers its paramagnetic sus- 
ceptibility and so, at constant field strength, its magnetization. In- 
stead, however, we could hold the magnetization constant by increas- 
ing the field. The detailed computation is not difficult (cf. Ex. 127, 
p. 530). 

Of considerable experimental importance in recent years is the 
temperature change accompanying adiabatic processes, because by 
their use we are able to approach closer to the absolute zero than by 
any ojher means. Since adiabatic processes are characterized by 
constant entropy, we must write down this function for a magnetized 
body. Transcription of equation (37) yields 



In studying paramagnetic susceptibility we found that as long as the 
elementary magnets do not influence each other — and hence the Curie 
law rather than the Weiss law is valid — l is a function only of H/T. 
Reciprocally, H/T is then a function of l alone and so the derivative 
on the right side of the above equation vanishes. Thus the internal 


as the internal energy of an ideal gas is independent of the volume. 
We conclude that a material obeying the Curie law corresponds to an 
ideal gas. As in the case of gases, the departures from the law, which 
manifest themselves in an internal field, become increasingly important 
at low temperatures. For an ideal magnetic material, the transcription 


of (34) becomes 



(41) 


The second term in brackets represents the magnetic contribution to 
the entropy. The specific heat is to be substituted for the entire first 
term; and since we deal with solids or liquids no distinction need be 
made between c„ and c„. Note that, according to the above, there is 
ro influence of the magnetization on the internal energy and hence on 
the specific heat. 

In order to evaluate the magnetic share of the entropy we start 
with the fact that the magnetization l of a paramagnetic body is a 
function of fiH/kT, where fj. is the magnetic moment of the atom in 
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question. If this fraction is introduced as a variable x, the magnetic 
contribution to the entropy becomes, by integration of the second part 
of (41), 

~ ~ ly^) -/ /(*)*] • (42) 


Putting in the function fiL tanhx which holds in very many cases and 
noting that kL — R, we have 


» = — R(x tanh* — log cosh*) — Of 


• (43) 


In an adiabatic process the total entropy is constant, so that 

+ const. ... (44) 


On account of the dominance of the first term th& function O is nega- 
tive. If the field is switched on at a given temperature T the negative 
term appears and so the first term must increase in order that the 
entropy remain constant. This means that the temperature must 
increase to a value T -f- AT given by 

* T+AT 

f, T dT== ~®(kf) (45) 

For the small term <J>{jj.H/kT) we may use merely the initial temper- 
ature T. Assuming an average value c of the specific heat over the 
interval AT, (45) yields 



Conversely, on suddenly removing the field there must occur a drop 
in temperature whose magnitude is given by this same equation; and 
in the neighbourhood of absolute zero, where other adiabatio processes 
fail, this temperature drop may be of considerable importance, granted 
only that l depends solely on HIT (see above). By this means, temper- 
atures of only a few hundredths of a degree absolute have been attained. 
Since other methods will not work, the measurement of these temper- 
atures is accomplished by means of susceptibility measurements. Here 
the problem arises of computing the true temperatures from readings 
on the Curie scale, the two differing because of the appearance of ferro- 
magnetism. The difficulty has been overcome for a substance Buch as 
ferric ammonium alum. 

Ex. 127. Compute the difference of the specific heats at constant field and 
constant magnetization in a region where l = CHjT. 



CHAPTER XXXII 


Application op the Second Law to the Calculation op 
the Equilibrium op Thermodynamic Systems 

1. Gibbs’ Phase Rule: a General Theorem on the Maximum Number of 
possible Phases. 

Consider a perfectly general system containing a independent 
components (substances) in /J phases. We shall find presently that /3 
has a maximum value, determined by the number of independent com- 
ponents. We indicate the components by subscripts and the phases 
by superscripts attached to the letters. Thus the ma£s of the tth 
component in the &th phase is represented by M t h . 

The equilibrium is considered when the system is at a given tem- 
perature and a given pressure; thus We consider it an isothermal- 
isobaric system rather than an isolated one. In this case the Gibbs 
Potential 

G—V — TS + pV (1) 

is a minimum, according to p. 525. Of the quantities entering here, 
T and p are entirely independent of the mass of the system. On the 
other hand, U, V and S are proportional to the mass, for given tem- 
perature and pressure. From this, one sees that G is also propor- 
tional to the mass; thus if all masses are doubled, G becomes twice 
as great. In general 

G(qM x , qM» . . . gM u ) . . MJ. . ,2) 

But this is the definition of a homogeneous function of first degree 
in the masses. For homogeneous functions of nth degree the Euler 
theorem 

m+**m+ ■■■*•!&.- M > ■ ■ ■ * J < 3 > 

is valid. In our case this is 

">! + + «>■■■ M ->- <‘> 

To show this, it is necessary only to differentiate equation (2) with 
681 
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respect to q and set 9=1. The right member is a homogeneous func- 
tion of first degree in the masses. In order that the left member be 
such a function also, the derivatives dG/dMt must be homogeneous 
functions of degree zero, i.e. the derivatives may be functions only of the 
ratios of the masses. 

After these preliminary remarks concerning the properties of the 
function G, we return to the question under consideration. The quan- 
tities V, V and S are additive, e.g. the internal energy of a system is 
equal to the sum of the energies of its parts. Hence if we distinguish 
the Gibbs Potentials of the several phases by accents, we have for the 
entire system 

<?=£' + <?"+... G». ..... (5) 

The 6r* have, of course, the properties deduced above. Besides, they 
depend also on T and p. The condition of equilibrium is 

SG = SG' + 8G" + . . . SG* = 0, . . . . (6) 

with the supplementary conditions that ST — 0 and 8p = 0. In 
addition to this, the mass of each component must remain unaltered; 
the distribution of a given substance among the various phases may 
change, but the total mass of the substance remains constant. We 
then have the following a auxiliary conditions also: 


+ SMf + . . . 8 Mf = 0, 
SM 2 ' + 8M 2 " + ... 8 Mf = 0, 


( 7 ) 


8M a ' + SMf + ... SMf = 0. 


Taking the variation of G while T and p are held constant, we obtain 
the equation 


dG' , SG' , 3G'_ 8m: 




SG" 

h aW' 


SM 2 "+. 

sgp 


+ '" + dMf 


} 8 M/+... 


m." 


(8) 


Since the variations are not arbitrary, but are subject to the 
conditions (7), we apply the method of Lagrangian Multipliers 
(cf. p. 115), multiply (7) by A x , A a , . . and add them to the principal 

* The partial derivatives occurring here are sometimes referred to collectively as 
the Gibbs Potentials. Gibbs pointed out that they furnish a quantitative measure of 
the concept of chemical affinity. See p. 5541 
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equation (8). Since the coefficients of the 8M t te must vanish separately, 
we obtain the system of equations 


m? + Xi=0> 

^+A,= 0 ,.. 




m? + a 2=°,- 

■■mr + A “ =0 ’ 

• • 0) 


ms +x * =0 ’ 

m}+ A ‘ = o- i 



These equations, aj3 in number, must be satisfied. How many variables 
are at our disposal? First there are the a multipliers — then the deri- 
vatives depend on T, p and the (a — 1) mass ratios, i.e. on (o + 1) 
variables. Inasmuch as T and p are the same for every phase, we have 
in all <z + /? (a — 1) + 2 free variables. Since the number of equations 
can at most equal the number of variables, 

^ a + 0(a — 1) + 2 
or 

0^a+2 (10) 

The number of phases may he, at most, two greater than the number of 
independent components. When /? is equal to a + 2, T and p are com- 
pletely determined, since solution of the system of equations yields 
definite numerical values for these quantities. Thus a system consist- 
ing of a single substance can have three phases at most. For example, 
at a definite temperature and pressure water, water vapour, and ice 
can co-exist in thermodynamic equilibrium, but a second form of ice can- 
not be included. In the latter instance there could be no equilibrium; 
changes would take place until one phase would be entirely consumed. 

If there are but two phases we still have one variable at our dis- 
posal. In a system consisting of water and water vapour the tempera- 
ture, for example, is at our disposal, but the pressure is uniquely 
determined by the temperature and is not a second independent 
variable as in the case of a single phase system — say a gas. In general, 
the difference between the highest possible number of phases and the 
number of phases actually present is the number of degrees of freedom, 
f. Thus 

/— a + 2 — (} (11) 

As an example of a system consisting of two substances, consider a 
solution of sal ammoniac in water. Assume a very dilute solution with 
a second phase consisting of the water vapour above the solution. 
If the solution is cooled, ice begins to form at a temperature a little 
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below 0° C. We now have a third phase: ice. According to the phase 
rule there is still one variable free; take this to be the temperature. 
The concentration is determined by this variable, for if we continue 
to cool the system, more ice is formed, the solution becomes more 
concentrated and so its freezing-point sinks. The last fact is one of 
experience; a thermodynamic derivation of the result will be given 
on p. 646. Finally, the solution becomes saturated and the solid salt 
also precipitates from solution. The temperature at which tins occurs 
is called the eutectic point ; this is the lowest temperature the syBtem 
can attain. At this quadruple point the vapour, solution, ice and salt 
are all in equilibrium. If, on the other hand,' we start with a highly 
concentrated solution, the solid salt comes out of solution as the third 
phase when the ^temperature is lowered. Additional cooling causes 
the concentration to decrease further. Thus the curve of concen- 
tration versus temperature is also the solubility curve. If the eutectic 
point is reached from this side, there is again a simultaneous pre- 
cipitation of ice and salt. A solution having the concentration corre- 
sponding to the eutectic point has the lowest possible freezing-point. 

2. The Vapour Pressure Curve and the Melting-point Curve. 

The method of solving thermodynamic problems which was used in 
the preceding section consists in seeking the extremal value of the 
function S or F or G , whichever is pertinent to the situation at 
hand. The procedure demands 
little thought but does require 
some computational work. It is, 
however, possible to proceed 
otherwise by considering a cyclic 
process for each special case, and 
applying to it the two thermo- 
dynamic laws. We shall compare 
the two methods in deducing the 
vapour pressure curve. Let us 
formulate the problem in this way: 
Consider a quantity of liquid 
in a closed vessel together with 
F«*- ? its vapour. The vapour pressure 

is measured by means of a mano- 
meter. Since we are dealing with a two-phase system involving only 
one substance, there will be but a single degree of freedom, which we 
take to be the temperature. The volume plays no part, for if we de- 
crease the volume the pressure does not increase— -as it would if a gas 
alone were present. Instead, some of the vapour eondefises. Using the 
device of a cyclic process for determining the relation of p and T, let 
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us consider tlie cycle represented by fig. 1. At a temperature T + dT 
we vaporize one mol of liquid, thus moving a piston forward against 
the pressure p + dp. Next, we relieve the pressure somewhat by de- 
creasing the application of heat and arrive at the temperature T. At 
this temperature we reduce the volume until the vapour has again just 
condensed, after which we apply a small amount of heat to bring the 
substance back to the temperature T + dT. Neglecting all second- 
order quantities, as well as the minute quantities of heat involved in 
passing between the two neighbouring temperatures, the work obtained 
turns out to be dp{v x — v a ), where the subscript 1 refers to the gas and 
2 to the liquid. In the evaporation process an amount of heat equal 
to the heat of vaporization q T + a was co mmuni cated. By the 
Second Law, a fraction dT/T of this was converted into work, bo that 

M v i ~ v z) = ox = ( Vl -v a )T * (12) 


This Glausius-Clapeyron differential equation of the vapour pressure 
curve is rigorously true, but before performing the integration we make 
some simplifying assumptions. For one thing, we neglect the volume 
v 2 of the liquid in comparison with that of the vapour; also, we assume 
that the equation of state of an ideal gas applies to the vapour; finally, 
we work with constant specific heats. First consider the liquid to 
evaporate at T — 0 and p — p, and the resulting vapour to be warmed 
to T°. Alternatively, let the liquid first be warmed from 0° to T° and 
then vaporized. We have 

go + o v T = q T + off or q T = q 0 -f (c„ — c a )T, . (13) 


where q 0 is the heat of vaporization at T = 0. Then 
dp _ [go + (Qp ~ Ci)T] v 

dT RT* ‘ ’ 


Integrating, 


logp = — || i 4^-~?log2’-fiog J 4; . . 


p = 


(14) 

(15) 

(16) 


Since the influence of the last factor is small, the equation shows that 
the vapour pressure rises rapidly — almost exponentially — with the 
temperature. 

Let us now deduce the vapour pressure curve by the other method 
as well. In order not to repeat exactly what was done, let us take 
account here of the temperature dependence of the specific heats. At 
equilibrium the temperature is, of course, the same throughout the 
liquid-vapour system. Inasmuch as we seek the equilibrium position 
for a given value of T and of p, we must find the minimum value of 
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G. If there are n, mols of vapour and n 2 mols of liquid, the total free 
enthalpy is . 

G = n$ x + n^, (17) 

where the refer to one mol. The variation is to be taken under the 
conditions 

T — const., p — const... % + w g = const. . . (18) 

Since the g t depend only upon T and p, they must remain constant, 
on account of the auxiliary conditions. We have thus 

0 = Sntfx + Sn^ (19) 

We also have 0 — -f- Sna, (20) 

The method of undetermined multipliers yields 
9i + A = 0, g % + A = 0. 

or 9i ~ 9z = «i ~ «2 + P( v l — v z) + T(s 2 — s x ) = 0. (21) 

We must now try to compute the separate terms of this equation. 


For this purpose we treat the vapour as an ideal gas. In the neigh- 
bourhood of the point of condensation this may involve considerable 
error. We also neglect the volume of the liquid in comparison with 
that of the vapour in calculating the factor (v x — v 2 ). Under these 
conditions, by (13), p. 604, 

* * **1 == “i.o + f c vi dT, 

J o 

We have, in addition, 

pv x = RT — J T R dT (23) 

The expression (% i0 — u 2i0 ) represents the value at absolute zero 
of the difference in energy between vapour and liquid, i.e. the energy 
which must be supplied to the liquid to overcome the internal forces 
of attraction. This is then the value at absolute zero of the work 
of vaporization (or since it is usually given in calories, the heal of vapori- 
zation). Using equation (23) and Mayer’s equation [(14) p. 605], there 
results T 

— v 2 + PK — v z) = ?o + f c„dT — f c 2 dT. (24) 

*'o J o 

* For the liquid we start also with the relationship (dujdT) r - e„; in the oase of 

a liquid or a solid, however, the external work may be neglected, so that the two 
specific heats may be considered equal. 
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To calculate the expression Ts we turn to the differential equation 


for the entropy: 


ds = ^(du+pdv). 


For the liquid we can again neglect the work done in expansion, just 
as we need not distinguish between the two specific heats for liquids 
and solids. Then’ we have , 

, al 

*2 = 02 y - 


Integrating between the limits 0 and T, 

s 2 =£^ + s t ' 0 ( 25 ) 

The quantity s 2 0 — the entropy of the liquid at the absolute zero — is 
undetermined in the classical thermodynamics. We shall find later 
that for a homogeneous liquid this quantity must be zero. Expressing 
p in terms of v in the term pdv, we have for the vapour 
, c„dT . Rdv 

Indefinite integration gives 
=J C -^ + Rlogv 

=J^^ + RlogT + R\ogR-Rlogp + const. . (26) 

Now take the absolute zero as lower limit, and combine all terms not 
dependent upon T and p into a single constant s h 0 ; this represents .the 
entropy of the gas at T — 0 and p — 1. Since 

/ dT 

and c„ — c„ = R, 

the entropy at temperature T is given by 

8 i == J^ C -^ — RI °SP + s i-o- •••**• ( 26 ') 


Now the specific heat of an ideal gas does not vanish at T — 0, and 
so questions of convergence arise in connexion with the integral. In 
order to clarify the situation, consider the specific heat c„ to be made 
up of a part c„ e which is independent of the temperature and another 
part c,,,. which, according to experience, approaches zero rapidly as T 
decreases. This latter part is absent altogether for a monatomic gas. 
We thus take Cp = Cjlc + U w (27) 
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For p — 1, we have for a very small lower limit, 

s AT, 1) - sAT 0 , 1) =f * + c M log T - c, c log T 0 . (28) 

Because of the rapid decline in the value of c*,, we have written the 
lower limit of the integral as zero. Since-the contribution of c„ to the 
integral is negligible even at T 0 — 1° absolute, we may interpret s li0 
approximately as the entropy at T — 1 and p — 1. Strictly, it is the 
limiting value of the difference s x {T 0 , 1) — log T 0 as T Q approaches 
zero.* 

Replace also o„ by c pc + and substitute (24), (25) and (26') in 
equation (21). Solving for log p, we obtain the vapour pressure formula - 

- -m+ mfy T - + U' 5 # 




(29) 


If we were to assume constant specific heats we would again obtain 
equation (15), p. 535, except that the constant of integration would be 
written differently. Of course, differentiation would again yield the 
Clausius-Clapeyron equation. 

The constant (s hQ — s ti0 — c^jR appearing in (23) is called the 
" chemical constant ” of lie gas, since it occurs also when considering 
chemical equilibrium. It is not determinable by the two thermo- 
dynamio laws, but may be evaluated theoretically by calculations 
involving quantum statistics. This constant may be determined 
experimentally by means of the observed vapour pressures, however, 
since all remaining quantities in equation (29) are sufficiently known. 

It is customary here to measure the pressure in atmospheres, to 
replace logarithms to base e by ordinary logarithms and to measure 
all quantities in thermal units (R = 1-985 cal). Hie result is 

loglW = — lo § T + f57fy f 0 ~ c vr) dT 


1 j T {Cyr — Cz)dT _ 


(30) 


~ 4-571 J 0 T 

As we shall find later, s 2 0 vanishes, so that 

‘ («i,o ~ o w ) - log (1-013 X 10«). 

For a monatomic gas of atomic weight A the quantum statistics 
(p. 638) gives the -value _ 

» = — 1 - 59 + | log A (31) 

nit is a consequence of the empirically verified 
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AD lie equations derived above are valid also for the transition 
from the solid state (subscript 3) to the gaseous state. We need only 
replace the subscript 2 by 3 throughout; q is then the heat of sublima- 
tion. On the other hand, the integrated formulae contain an approxi- 
mation which prevents their application to the transition between the 
solid and liquid states, for here the two volumes v 2 and v s are approxi- 
mately equal, so that it is no longer allowable, as it was in the case 
of a gas, to neglect one as compared with the other. The Claushls- 
Clapeyron equation, however, is exact and may be applied also to 
the phenomenon of melting. Inverting the formula, it gives the 
variation of the melting-point with pressure : 


dT T(« a — 1> 8 ) 
dp q r 


. (12') 


The quantity q r represents the energy which must be supplied to the 
system to bring it from the state of higher subscript to the lower — 
in our case the heat of fusion. Since this is always positive, the sign 
of dT I dp depends only on that of v 2 — v 3 . If, as is usually the case, 
the solid is denser than the liquid, v a > t> 3 , and the melting-point rises 
with increasing pressure. In the opposite case, e.g. for water, the 
melting-point falls as the pressure increases. 


3. Chemical Equilibrium in a Mixture of Ideal Cases. The Law of Mass 
Action. 

We now consider a system consisting of only one phase — the gaseous 
— but of several substances, aU of which are ideal gases. The gases 
will react with each other chemically to a certain extent, but a chemi- 
cal reaction is never complete in the sense of the equation. For 
example, if hydrogen and oxygen are allowed to combine to form 
water, there remains always a certain amount of the original substances 
H 2 and O t . The fraction of the original material remaining increases 
with the temperature. We set ourselves the task of determining the 
distribution in the state of equilibrium as a function of the pressure 
and temperature. Let the volume of the system be V and let the 
number of mols of the tth gas be n t . Represent the possible reaction 
by the equation 

v i-di + »v4 2 + . . . v„A m ^ v m+1 A m+ i + v m+% A m+i o- (32) 

In order to avoid mistakes in sign, we agree to write the equation in 
exothermal form, in which the heat of reaction is positive. In altering 
the numbers of mols, it must be remembered that they must satisfy 
the chemical equation, i.e. they must be in the ratio of the number of 
mols formed or removed. On account of the sign, we specify that the 
numbers of mols present before the exothermal reaction (i.e. those 
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on the left) are positive — those on the right, negative. Hence * 

8% : Si ig : . . . Sn m : S n m+1 : . . . 

— + v i • + : . . . — v m+ i : — v m+2 : . . . (33) 

Since we investigate the equilibrium at a given temperature and 
pressure, the variation of the function G — U + pV — TS must again 
vanish. The several terms may be computed as follows: The energy 
of the mixture is, naturally, equal to the sum of the energies of the 
components: 

XJ =s jjjWj -f n 2 u 2 -ft. • • == . . . (34) 

For the volume we have, by p. 496, 

V = Em, — (35) 

V 

The calcrdation of the entropy of the mixture as a function of T and 
p is somewhat more difficult. It is true that we know the entropy of 
a single ideal gas (§ 2, p. 537). Moreover, if we have two gases side 
by side, both at the same pressure, their entropy is equal to the sum 
of the individual entropies, but if the gases are allowed to mix, the 
very process of jnixing is a self-maintaining one and involves an 
increase of entropy; this process is simply the expansion of both gases 
to the total volume of the vessel, whereby the partial pressure drops 
but the total pressure remains constant. It would seem more plausible 
that the entropy of the mixture should be equal to the sum of the 
entropies of the separate gases when each one occupies the volume of 
the mixture at a pressure equal to its partial pressure, for the process 
described on p. 523 permits the two gases to mix without change of 
entropy in such way that the partial pressure remains the same after 
mixing. 

In* order to simplify the calculation, we assume the specific heats 
to be constant and obtain, as in equation (26') (p. 537 ), 

*< = log T — R log Pi 4- s w , .... (36) 

where s i0 embraces all constant terms, including the term appearing 
for constant specific heat, c„ log T 0 , as T 0 -> 0! 

Expressing the partial pressure in terms of the total pressure, we 
have, by p. 496, 

Si — c vi log T — R log n t — Rlogp + R log En ( + s m 


— s { (T, p) — R log 4~ 72 log (37) 

and S = ZniSi(T,.p) — RZn { log n { -f .REw* log Ew ( . . (38) 


* With this agreement as to sign, the changes 8 n ( signify the numbers of mob 
formed in the endothermal reaction. 
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The entropy of the ith gas at a temperature T and pressure p is 
denoted by s ( {T, p). Multiplying (35) by p, adding it to equation (34) 
(p. 540) and subtracting (38) multiplied by T, and making use of the 
fact that RT = pv f , we obtain the Gibbs Potential of the system: 

0 — 'Sn i g i (T, p) + RTT,n t log n t — jRTEw, log Ew<, (39) 

where the g t (T, p) denote the Gibbs Potentials of the separate gases at 
temperature T and pressure p. In order to determine the equilibrium 
it is necessary to take the variation of G with the auxiliary conditions 
(33) (p. 540), as well as with 8T = 8p — 0. Since the g t depend only 
upon T and p their variations vanish and we obtain 
8G = 0 = £ 8n i g i -J- RTTi Sn^ log n t -j- RTTi 8fif 

—RTT, 8n t log £«* — RTT, 8 n t , .... (40) 

or, on account of (33), 

S^log^-Sv.log 2n t = T) , ... . (41) 

n { 

or since p { = p 9 

Hv { logp f = — ^ + Sv,. logp = log K p {p, T ). 

It remains only to calculate the right-hand member. By equations 
(22) and (23) (p. 536) and equation (36) we have, for c v constant, 
—"Lvfg^p, T) -J- j RT’E vf logp == — X^m <0 — Hv t o vi T — hvfRT 

+ T log T"Lvfi vi — RT logpT,v ( 4- TSvjS, 0 + RT log pTLv t 
= — — TTLvfij* + T log TI,ufi vi + T£ . (43) 

According to our convention regarding the sign, T,v t u i0 represents 
the difference in energy at T = 0 of the system before and after the 
reaction (written in exothermal form), i.e. the heat of reaction q~ 0 at 
the absolute zero. After dividing by RT we note the pressence of the 
combination (s <0 — o pi )/R, the “ chemical constant ” i,- of the indi- 
vidual gases which we already met in connexion with the vapour 
pressure curve. Hence, by analogy with equation (30) (p. 538), 

Sv, logp <(atm .) = — + £ v t i t , (44) 

or using a constant B which depends only upon the i t w r e have, similar 
to (16) (p. 535),' * 

* Formulae (30) and (44) are intended for practical use, while (16) and (44S$ give a 
better indication of the theoretical basis. 
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Tlp/i = -- - = BirrJRT T %Hc si !R = K ( p T). (44') 

* , »+r»+! 


Equation (44') expresses the celebrated Law of Mass Action, deduced 
by Guldberg and Waage from kinetic considerations. These investi- 
gators, however, were not able to give an expression for the mass 
action “ constant ” K v , which really depends on T and p, and so is a 
constant only at a given temperature and pressure. The name “ Law 
of Mass Action ” comes from the fact that a change in the mass (and 
hence the partial pressure) of one of the factors in the reaction imme- 
diately causes a change in the partial -pressures of the others. If, for 
example, we have an equilibrium between an exothermal compound 
1-2, (subscript 3), with its two constituents 1 and 2 , and if T and p 
remain constant, the addition of one of the constituents immediately 
causes the partial pressure of the compound to increase, since the 
fraction PiPjPz remains constant. This means that the addition of 
one of the products of dissociation lowers the degree of dissociation.* 
For many purposes it is more useful to write the Law of Mass 
action not in terms of partial pressures but in terms of relative con- 
centrations c t (mol fractions) which are defined by 




En/ 


(45) 


Using Pi — o t p and v — there follows 

lie,** = -- &?& ■ *••• .. = Be-°~' IRT p~* = E a (p, T). (46) 

c ¥mil c Vm+i . . . 

»+l «+2 


For reactions which involve no change in the number of mols, v be- 
comes zero and hence K c — K„ and the equilibrium does net depend 
on the pressure. If one compares equation (44') with the vapour 
pressure formula (16) (p. 535) deduced under the same simplifying 
assumptions (constancy of specific heats), the similarity becomes 
evident. This is not fortuitous. In the following sections we shall 
derive a generalized law of mass action for polyphase systems, in 
which category the simple process of evaporation of a single substance 
is also included. As in the case of the vapour pressure formula, we 
can sat up differential relationships fol the law of mass action which 
are free of approximations. Using the mol fractions c f , (41) becomes 

E v, loge t = — E v t gi — 4- \ogK e (p, T). . (47) 


* This result, as well as many others, both physical and chemical, comes within 
the scope of a general law given by Le Chfttelier: If some stress (e.g. a change of pres- 
sure, temperature, concentration, tea.) is brought to bear on a system in equilibrium 
the equilibrium is displaced in the direction which tends to undo the effect of the stress. 



XXXII.] THE SECOND LAW AND EQUILIBRIUM 543 

Since P — — s ( (of. p. 527), 

(pvr)r (48) 

But the expression S ^(m* + P°t) is the heat of reaction at constant 
pressure, q~ p , by equation (38) (p. 511). Hence 



In the same way, if we denote by v the change in volume occurring in 
the reaction (which is determined by holding the pressure constant), 
i.e. £»w<, then we have, since (cg;dp) T — v. 



4. Chemical Equilibrium in a System consisting of Dilute Solutions and 
Ideal Gases. 

Those multiple phase systems which contain, besides the gaseous 
phase, dilute solutions, i.e. phases in which one substance (subscript 0) 
— the solvent — is present in far larger quantity than any of the others, 
lend themselves to detailed investigation. We consider a solution to be 
dilute in the thermodynamical sense if further addition of solvents 
causes no decrease in volume and if no heat exchange (heat of dilution) 
is detectable with further dilution. The more nearly these conditions 
are fulfilled, the better our conclusions will agree with experience. 
We again examine the equilibrium at a'given temperature and pres- 
sure and must therefore find the extreme value of the function G. 
The separate terms may be calculated as follows: Denoting the sol- 
vent by the subscript 0, the internal energy is given by 

U — U(p, T, n 0 , (50) 

On account of the smallness of the mol numbers n x , n 2 , . . . we can 
develop this function in a Taylor’s series and obtain 

+ (51) 

The partial derivatives depend only on the number of moLs of the 
solvent, since the derivatives are taken at n v n^, ... — 0. Write u { 
for these derivatives: 

V — n 0 Ug 4 - + « 2 «a+ (52) 

In keeping with our original assumptions we neglect the mutual energy 
of the molecules of the solutes; this corresponds to breaking off the 
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series after the linear terms. According to Coulomb’s Law, the mutual 
energy of charged particles (e.g. the ions in electrolytes) is large on 
account of their relatively great charge, even when the particles are 
far apart. Hence our results will not be applicable to electrolytes. 

In the same way We make the analogous abbreviation v t — dVjdn t , 
and have 

V = n 0 v 0 -)- + w 2 u s + • • • • „ • . . (53) 


According to our convention, u t and v t represent the partial energies 
and partial volumes respectively of the dissolved substances per mol. 

If we were to develop the entropy function in the same way, we 
would have „ 

S = w^o -f- + n 2 s 2 + . . . , . . . (54) 


which would represent the entropy of a system consisting of n 0 mols 
of solvent side by side with n t mols of the solutes, but the mixture of 
the substances would not be represented. Hence there must be, in 
addition, a funotion of the mol numbers which does not depend on T 
or p. This function is determined if we know its form for any one state 
of the system. According to p. 541, the value of this function is 
— BSn,- logo, for the gaseous state. This value must be retained for 
all temperatures and pressures. Thus, to a first approximation, we 
have the same expression as that derived for gases in § 3 (p. 641): 

G^Zn i g i (T,p) + RTZn i logc i . . . . (55) 


If the solvent does not take part in the reaction we set v 0 = 0, and 
the chemical equilibrium in a dilute solution is governed by 

n e?*=*K,(p,T) (56) 

The dependence of the equilibrium constant K e on temperature is 
again given by equation (48') (p. 543). There is no variation with ■ 
pressure, since no change of volume occurs. The most important 
reaction which takes place in the solution is that of electrolytic dis- 
sociation, but in this instance the inter-ionic forces cannot be 
neglected. This is why strong electrolytes, i.e. those which are highly 
dissociated, do not obey the law of mass action. In such solutions the 
interactions give the impression that the dissociation is incomplete, 
while actually it is practically complete at great dilution. On the 
other hand, weak electrolytes — especially organic ones — conform 
very well to the mass action law. 

We can now investigate the equilibrium of multiple phase systems, 
also, provided they coqpist of ideal gases or dilute solutions. As a 
special case, of course, individual phases may consist each of a single 
substance. We must rule out the presence of liquid and solid phases, . 
which comprise several substances in approximately equal concen- 
tration, since it is then no longer allowable to terminate the Taylor 
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expansion with first order terms. We again distinguish the phases 
by dashes and superscripts, and have for the G function of the entire 
system 

G — 2 n/g/ip, T ) + 'Ln”g i "(p, T) -f. . . . 

+ RT (2 n l log c/ -(- 2 n" log e" + . . .) 

= 2*2,» < <*>0/»(p, r)+2jr22n/»logc/«. . . . (57) 

The determination of the extremal value of G under the usual auxiliary 
conditions leads, exactly as above, to the equation of the generalized 
law of mass action, which also includes transformations between 
individual phases: 

2 t 2 < v 1 (*> logc/*)= - ~ 2*2 iV Wg^{p,T) = log K e (p, T). (58) 


The same equations are valid for the derivatives of the function logZ, 
as in the case of the special mass action rule for single-phase systems. 

We now consider a two-phase system whose liquid phase is a dilute 
solution and whose solid phase is the pure solvent. Characterizing the 
solvent by the subscript 0, we have, for the liquid phase, 


»o' + V’ 


n o + n i 


and for the solid phase 

Ci" — 0, e 0 " = 1 (59') 


The reaction in question is, in the exothermal formulation, the passage 
of one mol of solvent from the solution into the solid phase, whereby 
the heat of fusion is liberated as the “ heat of reaction ” q~. Hence 
the condition of equilibrium is 

log(l — <h') — logl = logtf c (p, T). . . . (60) 


At a given pressure, e.g. atmospheric pressure, this is a relation be- 
tween the equilibrium temperature T, i.e. the melting-point, and the 
concentration of the solution. For the pure solvent alone, — 0, and 
the right member becomes zero, since only log 1 occurs on the left. 
If, at constant pressure, log K e {p, T) is developed in a Taylor’s series 
which we terminate with the first term, and if we replace log (1 — c*') 
by the approximate value — c/ we obtain, by equation (48') (p. 543), 


-^={T~T 0 ) 


' 9 logffA 
, dT ) t 


= (T-T 0 ) 


RT o* 


or 


T-T 0 


RT* 

n 0 + < <Cv 


% RTl 

< r * ' 


. . (61) 


Since the conversion of one mol of liquid solvent to -the solid state 

*« (* 711 ) 
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liberates heat, q~ v is positive, so that the freezing-point is lowered. 
The equation shows the depression of the freezing-point to be propor- 
tional to the ratio of the number of mols of dissolved substance to the 
number of mols of solvent., This iB Raoult’s Law. The factor of pro- 
portionality is determined by the melting-point T 0 of the pure solvent 
and the heat of fusion q v . If several substances are dissolved, we have 
the sum S n f /n 0 in place of %/n 0 , as may be verified at once. If, for 
example, we dissolve a substance which breaks up into two con- 
stituents, say a binary Z-Z- valent electrolyte, the depression of the 
freezing-point must be twice as great as for the same number of mols 
of a substance which does not decompose. In reality the double value 
is attained only when the dilution is very great, from which we already 
concluded that equilibrium between molecules and ions exists, especi- 
ally since the degree of dissociation as determined by the lowering 
of the freezing-point agrees well with the values obtained from electro- 
lytic conduction. Actually the dissociation of strong electrolytes is 
much greater than that to be expected on this ground; the deviations 
from the value corresponding to complete dissociation are for the most 
part attributable to the electrical forces operating between the particles 
(cf. the following section). 

For substances which do not break up (non-ionogens) we obtain 
the molecular weight from the lowering of the freezing-point. It is 
readily calculated from (61) that one molecular weight of such a sub- 
stance dissolved in one litre of water causes the freezing-point to drop 
to -1-86° C. 

The 3ame calculations hold also for the donation of the boiling-point, 
the pressure remaining constant. The sign of T — T 0 is reversed, 
since the solution is now the phase of lesser energy. The calculation 
shows that one mol of a non-ionogen dissolved in one litre of water 
raises the boiling-point to 100*52° C. 

Finally, we wish to derive the formula for the osmotic pressure of 
a solution. Consider a system composed of two liquid phases, the 
solution and the solvent. They are prevented from mixing by a 
separating wall. Let this wall be selectively permeable to the solvent. 
The wall may be considered non-existent for a substance whose free 
passage it permits; hence there must be the same number of mols of 
this substance per cubic centimetre on both sides of the membrane, 
as in the case of a gas. But in the system under consideration this is 
certainly not the case; for, besides the solvent, the dissolved sub- 
stance is also present on the solution side, so that the solvent will 
continue to pass through the membrane until there is no longer any 
difference in concentration on the two sides. Thus with a solution of 
finite concentration and equal piossure on the two sides, equilibrium 
cannot exist. However, by placing the solution side under higher 
pressure, the passage of solvent can be prevented. 
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The two phases are thus under different pressures. A system of 
this kind is not included in our previous considerations, and the con- 
ditions for equilibrium must be derived anew. We wish to examine 
the equilibrium at a given temperature, a given pressure and a given 
pressure difference, which we call the osmotic pressure. Hence a virtual 
displacement of equilibrium is free to cause only a change in the 
number of mols in each phase, and it is evident that the condition of 
equilibrium 80 = 0 again leads to the equation 

2 2, j Mog c / = _- 1 -2S, < V. 

The difference is that the functions g* refer to different pressures. 
In the case at hand, the reaction consists of the transfer of one mol 
of solvent to the solution side. To distinguish the solution from the 
solvent, the former is denoted by a prime. Then 
1 log c 0 ' — 1 log I — log(l — of) ss — Ci' 

= ^ < 7 0 (p, T) - ±,g 0 ’(p -f jw T). . (62) 


Developing the right member in a Taylor’s series which' we terminate 
with the first term, and using the relation (31) (p. 527), we obtain 


or 


< RT\dp/ r Imm 


_ n l , RT r 


nfRT 
V • 


(63) 


Thus the osmotic pressure is equal to the pressure which the same number 
of mots of the substance would exert if they were in the form of a gas 
occupying the volume of the solution at the same temperature. This was 
first shown by van’t Hoff. The law will be deduced from another point 
of view on p. 585. 


5. Thermodynamic Equilibrium of Dilute Solutions of Strong Elec- 
trolytes. 

As already mentioned, the older theory recognized the existence of 
an equilibrium between the molecules and the ions of strong electro- 
lytes. However, for strong electrolytes, i.e. those which are good 
conductors, the equilibrium does not satisfy the Law of Mass Action. 
The reason is that it is not permissible to neglect the inter-ionic eleo- 
trical forces in formulating the thermodynamic functions. The pre- 
viously employed assumption that the individual particles may be 
treated like the molecules of an ideal gas is evidently not justified 
here. We must add to the previously used internal energy t/ id an 
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electrical contribution which, by the method of Debye, may be com- 
puted without undue difficulty. 

In the case of electrical conduction the situation is more compli- 
cated, and so we refrained from going through the very cumbersome 
calculations when we treated this phenomenon on p. 423. 

The entropy of the system is also altered; as a result the function 
G, which determines all equilibria at a given temperature and pressure, 
also has an added term: 

G'=Q iA + G d = U + pV-TS. . . . (64) 

For condensed systems the term pV is not of consequence, and it 
is not necessary to distinguish between ( dGJdT)„ and (dG/dT) r ' t hence 
by equation (33) (p. 528) we have the following differential equation 
for G: , a 

°- v + T §- 

Since this equation is satisfied by G id , the added electrical term G** 
must also satisfy it; this serves to determine G el from U eh for it follows 
by integration of 


Tins shows that most of the work required to find the function G is 
done if the electrical energy Z7 d is known as a function of the tem- 
perature. This will now be determined for a binary Z-Z- valent electro- 
lyte. The extension of the calculation to more complicated cases is 
not difficult. Let us consider a single positive ion. As mentioned on 
p. 423, there will be — averaged in time — more negative ions in its 
neighbourhood than positive. The average distribution of charge may 
then be replaced by a continuous space charge determined as follows: 
According to Boltzmann’s Principle (cf. p. 588), the number of par- 
ticles possessing an energy u is proportional to 


* The integration is effected by starting with the homogeneous equation 


v vary the parameter, putting = T • v(T) and 


for v. The integral ii 
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where k is the gas constant per molecule, k = RfL. At a place where 
the potential is </i the charge Ze has the energy Zei/j, so that the number 
of positive ions in a volume element dV at that point is 

dN+=Ce~ Zt * lkT dV. ..... (67) 

The constant 0 is determined from the fact that uniform distribution 
must result for ifi = 0; C is thus the average number of positive ions 
per cubic centimetre, which we denote by N+. In the same way, the 
number of negative ions in the same volume element is 


dN_ = N_e +Ze * ikT dV. ..... (68) 

For a Z-Z-v alent electrolyte N+ — N_ — N. Hence the space charge 
density at a point where the potential is ^ is 

p=- 2iVZe8inh^j (69) 

Now p and </> are connected by Poisson’s equation (p. 278 et seq.): 

. . 47rp 877 -NZe . , Ze<p „ 

. . . (70) 


where D is the dielectric constant of the solvent. The value of the 
potential at a distance of one ionic radius from the point in question 
is such that we may develop the term &inh.(Zetp/kT) and omit the 
higher powers. In this way we obtain 


A</.= 


SttNZW 

DkT 




(71) 


where 


l&nNZ*e* 
V DkT * 


The quantity x, which has the dimensions of the reciprocal of a length, 
appears in all the subsequent computations. Its meaning beoomes dear if we 
digress for a moment from the problem in hand, viz. that of the distribution of 
charge round a central point, and consider the simpler linear case of a plate at 
the potential i^o dipping into a liquid eleotrolyte. The differential equation is then 

( 72 ) 

and its integral is * 

(73) 


Thus 1/x is the distance in whioh the potential — and hence, by Poisson’s equa- 
tion, the negative space oharge — has diminished to 1/e times its original value, 
i.e. 1/x is a measure of the thickness of the compensating layer. The smaller N is, 
i.e. the more dilute the solution, the larger is the quantity 1/x. 


* The letter e is used in two ways here, but there is no possibility of confusion, 
u the base of natural logarithms almost always carries an algebraic exponent. 
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For the spherically symmetric case which interests us here the 
differential equation becomes 

+ (74) 

As may be verified by substitution, the general solution is 


Since the potential must vanish at infinity B — 0. The constant A is 
determined by the fact that the potential in the immediate neighbour- 
hood of the selected ion must be that of the ion alone. The resulting 
value of A is then Ze/D, and the solution becomes 


y ~~D' t Dr Dr ' * '• * * * 

The second form shows how the potential is made up of one part 
attributable to the .central ion and another part due to the presence 
of the surrounding ions of opposite sign — the “ ion olond At r — 0, 
i.e. at the centre of the selected ion, the cloud generates the potential 

= - lim (1 — O = — — . . . (77) 


This is equal to the work required to bring a unit charge from infinity 
to the middle of the spherical cloud. Hence the potential energy 
possessed by the central positive ion is 

, V / ■ "7Q\ 


Similarly, for a negative ion we would have 
Z-eV 
M -~ D • 

Since each ion is counted twice over — once as the central ion, once as 
part of the ion cloud — the total energy is 

ri 1 2jVFZ 2 eV NVZW /SttNZ*<? 

y d== __ _ . (79) 

Substituting this in equation (66) (p. 548) and performing the integra- 
tion, 

( 80 , 
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or, for n, mols of solute, 


2 n 1 LZ t e t K 
~ 3 D ’ 


(80') 


where X is Loschmidt’s (Avogadro’s) Number. Then the Gibbs Poten- 
tial of the electrolytic solution at complete dissociation is 

n^-f 2n l g 1 +RTn 0 logc„-f 2fl.Tn 1 logo, — (81) 


, ISttNZW /8t / 87m, 

where K Sw = vw. w Vw • (82) 

The last term shows that k depends on the number of mols of the 
solvent. Once the characteristic function G is determined, all else is 
merely a matter of computation. 

We now calculate the difference between the lowering of the 
freezing-point in this case and that for complete dissociation without 
electrical interaction. The process consists of the conversion of one 
mol of solvent from the liquid state to the solid state. The only quan- 
tities which vary are the numbers of mols of the solvent in each phase; 
it is to be remembered, however, that these enter into the quantity k. 
We shall not yet replace the variations 8% in the equilibrium formula 
by the conversion numbers v ( jjhioh are proportional to them. Indi- 
cate the liquid phase by one dash attached to the letters and the solid 
phase by two dashes. Then we have, exactly as in § 4 (p. 545), 

K" lo gCo" + S V logo,,' 

+ ■ (83) 

Noticing that 

Smq" = — S?ip ; Cg' — 1; Cg — 1 2 o, ; logCp - 2 c, , (34) 


we obtain 


II, 

“ fij, (Ho “ Ho 


n^ZV 3k) 
D 3n 0 ' (* 


( 85 ) 


If, as in § 4, we develop the functions g 0 ' and g 0 " at the point T 0 , i.e 
Ghe freezing-point of the pure solvent, and if we remember that 




, 5 o - T . 


20,' = 


&Tq~ v n^LZW 3k 
" 1 DRT 0 3 n 0 ‘ 


( 86 ) 

(87) 
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If the electrical forces were not taken into account we would have for 
the lowering of the freezing-point, at complete dissociation. 


AT* 


2 c^RTf 
Tv ’ 


i.e. 


2ci' = - 


AT id 
RT 2 • 


. ( 88 ) 


Using this expression to compute the relative departure from the 
present value we obtain 


A T id — AT _ , AT 

AT^ ' ATu 


* DRTfPj' dn Q r 


It remains only to compute the value of 3/c/3n 0 '. From the approxi- 
mate expression 


’LZ 2 e? 


we have at once 


n^vJDkT 0 


and since n^fn^ = c/ and 22 = 

, AT _ , TZ 2 eV _ V2 w /< TW 

AT m * 22^ 3 V F 0 ' D 3i2 (RT 0 ) 3,r { ) 

We then have the following result: The relative departures from the 
value of the depression of the freezing-point for complete dissociation 
and negligible electrical interaction are directly proportional to the 
square root of the molar concentration inversely proportional 

to the 3/2 power of the dieleetrio constant and — for Z-Z-valent 
electrolytes— directly proportional to the cube of the valence. This 
relationship is well substantiated experimentally for very dilute 
solutions. 

A refinement of the calculation must take into account the finite 
ionic radius; this has been done in the Complete theory of Debye and 
Huckel. 



CHAPTER XXXIII 
The Nernst Heat Theorem 

1. Free Energy as a Measure of Chemical Affinity; Determination of 
this Quantity for the Galvanic Cell. 

In -the “ classical ” thermodynamics, which employs only the two 
fundamental laws discussed in previous chapters, the absolute value 
of the entropy remains undetermined; as a result, the values of the 
functions F and 0 are fixed only as far as a linear function of T. This 
indeterminacy becomes very inconvenient in certain applications. 
Thus we saw in the preceding chapter (p. 534) that it is not possible 
to calculate the absolute value of the logarithm of the vapour pres- 
sure from a knowledge of the specific heat and heat of vaporization— 
that we required, in addition, a knowledge of the “ chemical constant ”, 
which is closely connected with the entropy constant. But the latter 
is determined by the two laws of thermodynamics only to the extent 
that a value once given must be preserved in all changes of the system. 
On the other hand, the Nernst theorem leads to a definite statement 
regarding the entropy constant of solid and liquid substances. The 
calculation of the chemical constant of a gas can be performed only 
with the help of the Quantum Theory. It is through a knowledge of 
the entropy constant, or of the chemical constant, that the equilibrium 
formulae of the preceding chapter become really of practical use. Only 
when this is known can a chemical equilibrium be computed theoreti- 
cally. 

Nernst arrived at his "heat theorem through the problem of cal- 
culating chemical affinity. It was a considerable time before a practical 
measure of the somewhat anthropomorphic concept of the chemical 
affinity of two substances — or, as we say to-day, of a chemical re- 
action— was found. One of the foremost demands to be made of such 
a quantity would be that it tell in which direction the reaction pro- 
ceeds. Berthelot took the heat of reaction as a measure of affinity, 
and — as a matter of fact — at room temperature most reactions are 
accompanied by the evolution of heat. Nevertheless, there are some 
very common processes — like the dissolving of most salts in water — 
which are accompanied by cooling, and which are therefore endo- 
thermaL It was van’t Hoff who first recognized that for reactions at 
constant volume the free energy F is the proper measure of affinity, 
668 
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while for reactions proceeding at constant pressure it is the Gibbs 
Potential O. 131666 quantities are a minimum at equilibrium; the 
greater the values of these functions compared with the minimum 
values, the farther the system is from equilibrium — hence the more 
readily will the reaction occur. The free energy of a chemical reaction 
proceeding at constant pressure' is easily determined experimentally 
by using the reaction in a simple reversible galvanic cell and measuring 
the electromotive force of the cell. For condensed systems, i.e. those 
not containing the gaseous phase, it is not necessary to distinguish 
between isometric and isobario reactions and hence between F and O. 
Suppose the chemical change in this cell takes place by the migration 
of an ion bearing a charge of Z units. Then when one mol has been 
transported, a charge ZJ has passed. The Faraday equivalent charge 
is denoted by J here to avoid confusing it with the free energy. For 
a difference of potential <]>, the work done is given by ZJ<S>, and this 
is equal to the change in free energy: 

m-fx-u . a) 

Now /satisfies the differential equation 

so that 

• ■ P) 

and hence by (1), 

® = § + T (al); < a > 

This important relationship between the potential difference, the heat 
of reaction of the chemical process and the temperature coefficient of 
the potential difference, may be derived also from the equation 
TdS — dU 4- dW if the external work is expressed in terms of the 
electrical quantities (of. Ex. 125, p. 528). Since it is usual to express 
in volts and J in coulombs, the heat of reaction must be expressed 
in joiiles. 

By examination of the empirical data, it is found that the electro- 
motive force of most cells at room temperature is fairly accurately 
given by the simple formula 

obtained by setting the change in free energy equal to that of the 
total energy. TJhe lower the temperature at which the measurements 
are taken, the better the agreement is found to be. One would con- 
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elude from this that at absolute zero the differences,/^ — / 2 and u t — u~, 
are equal, i.e. that Berthelot’s Principle is strictly true at that tem- 
perature. By equation (2), this means that 

§£._3/i 

dT a T 


remains finite for T — 0, or else that it becomes infinite to so low an 
order that the limit 



vanishes. But since the agreement is still good at relatively high 
temperatures, Nernst made the hypothesis that the curves f x (T) — / 1 (2 T ) 
and u t {T) — iq(T) not only intersect at T = 0 but touch each other 
there. Corresponding to the way in which he arrived at his theorem, 
Nerast made a statement concerning only the difference of the 
functions / and u before and after the reaction, but Planck extended 
the hypothesis by assuming that it holds for and / 2 separately, 
i.e. that 

lim/(T) = lim u(T) 

As we shall see in the following section, this serves to determine the 
entropy constant of pure, condensed substances. 



2. Formulation of the Nernst Heat Theorem. 

We obtain a new form of the Nernst theorem by solving the dif- 
ferential equation of free energy (deduced on p. 527) for ( df/dT) v : 

m.-y <*> 

Since / becomes zero when T = 0, the expression df/dT assumes the 
indeterminate form 0/0 under these conditions. The usual means of 
evaluating the fraction is to differentiate numerator and denominator 
separately and then substitute T — 0. This yields 

< 6 > 

Now, by the Nemst-Planck relation, the entire right member is zero, so 
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that each term on. the right most be separately zero in this case. The 
character of the function is represented in fig. 1. 

According to equation (30), p. 526, 

($.--+ ■ 

from which it follows that the entropy 5 also vanishes at the absolute 



We may therefore state the Nemst theorem in this way: At the 
absolute zero of temperature the entropy of a pure solid or liquid has 
the value zero. This statement applies only to pure substances, 
since for mixtures there must be an additional term covering the 
increase of entropy which accompanies the process of mixing. We 
must also except gases, as long as we retain the classical gas equation, 
for this equation yields an infinite entropy constant at T = 0 if the 
value of S is to remain finite. In reality, it follows from the new 
statistics (p. 626 et seq.) that the equation of state of a gas is subject 
to a correction which preserves the general validity of the Nemst 
theorem. In particular, there can be no difference in the entropy cf 
two different modifications of a substance at the absolute zero. For 
instance, a paramagnetic material must have the same zero-point 
entropy whether it is magnetized or not; i.e. it must then behave 
like a permanent magnet even without the field. 

In a statistical interpretation, the Nemst theorem affirms that a 
body must be in its greatest state of order at the absolute zero. It is 
precisely here that the theorem has a limitation as far as its practical 
meaning is concerned. To be sure, the state of maximum order is that 
of equilibrium at T — 0, but as to the amount of time required for the 
attainment of that equilibrium, thermodynamics has nothing to say. 
Since the motions of ions and of molecules dwindle at low temperatures, 
states of non-equilibrium can persist indefinitely. For example, glasses 
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have a state of higher order— the crystalline — and yet a piece of glass 
is still glass at T = 0 ! Circumstances are otherwise in connexion with 
the motion of electrons and the orientation of the magnetic moments 
connected with them. Their motion is not hindered by any increase of 
internal friction with falling temperature. For this reason the conse- 
quences for magnetization are of especial importance. 

3. Consequences for the Specific. Heats and Temperature Coefficients 
The Chemical Constant of a Gas. 

It follows from 

. ( 8 ) 

that at constant volume 

/ds\ l /du\ c, 

T\dT/ v r 

and 

_&s_ ^ 1 0c, 
dTdv T dv’ * 

This is a relation we shall need later. Integrating (9) from 0 to T we 
obtain, by Nemst’s theorem, » 

< n > 

In order that the integral may converge, the specific' heat (for con- 
densed systems it is unnecessary to distinguish between c„ and c„) 
must approach zero sufficiently rapidly as the temperature diminishes. 
Actually, the theory of the specific heats of solids (p. 612) gives a 
decrease as the third power of T. On the other hand, the value given 
by the classical statistics for the specific heat of a gas is different from 
zero even at T — 0. Hence we must leave gases out of consideration 
as long as we take the classical statistics to be correct (see preceding 
section). It follows further from the Nerast Heat Theorem that 
all temperature coefficients of condensed substances vanish at the 
absolute zero. We shall demonstrate this for the isometric pressure 
coefficient. It follows from 

,= ~(^). md p== -0Q, • • • (i2) 

that (dp/dT) v is given by 

(If).- (I o.-x'fw*- •• 



. (13) 
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and from equations (10) and- (13), 



As regards gases, it has already been mentioned that theories have 
been developed in recent years which yield the usual gas equation as 
an approximation at higher temperatures, while in the neighbourhood 
of absolute zero neither the equation of state nor the constancy of 
the specific heat of a monatomic gas is retained. This is called the 
degeneration of a gas. Since all gases condense at temperatures con- 
siderably above the absolute zero, attempts to verify this degeneracy 
experimentally have not yet succeeded. We shall not take up these 
theories (cf. p. 628) at this point, but shall pursue another line of 
inquiry; this will not yield the value of the entropy at absolute zero, 
but will determine the entropy oonstant at finite temperatures. For 
this purpose we consider the equilibrium between a gas and its con- 
densed liquid, for which system the vapour pressure formula (29) 
(p. 538 ) is valid. Here the entropy constant s 2 Q of the liquid is to be 
set equal to zero. Again, we imagine the measurements performed 
at very low temperatures, where the integrals contribute nothing. 
This is because the specific heats of the liquid vanish as T 3 and the 
part of the specific heat of the gas which depends on T also approaches 
zero rapidly, and all that is left for any gas is the specific heat of a 
monatomic gas as the part c„ e which is independent of the temperature. 
The vapour pressure equation is then 

logp = - A + ^logT-f . . (15) 

Thus, by measuring the vapour pressure, the chemical constant 

K (16) 

(and therefore the entropy constant) of the gas may be determined.* 
On the other hand, the theoretical calculation of this quantity is 
possible only with the aid of quantum statistics. 


* The subsoript “ abs ” is appended to i to denote that this is not the u 
which the logarithms of the conversion factors are also included (of. p. 638). 
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4. Unattainability of the Absolute Zero. 

It is a consequence of the Nerst heat theorem that the absolute 
zero is an asymptotic point — it may be approached indefinitely but 
never strictly reached. Since many interesting phenomena are en- 
countered in the small range between, say, 1° absolute and 0, it would 
be better to use a logarithmic scale of temperature in this region, for 
the scale itself would then have an unattainable zero. 

In order to demonstrate the unattainability, we consider the most 
feasible means of cooling — that of adiabatic demagnetization discussed 
earlier. Assume that the entropy-temperature curves have the form 
shown in fig. 2 a, which is contrary to the Nernst theorem. In any case. 



s 


at higher temperatures the entropy curve S„ of the umnagnetized * 
body must lie above the curve S M of the magnetized body. The reason 
is that entropy, as will be explained in the following chapter, is a 
measure of the disorder; and this is certainly greater in the unmag- 
netized state, where the elementary magnets are oriented at random. 
Now assume demagnetization to take place adiabati cally and reversibly 
at a temperature T a . The process will be isentropic, for if SQ is zero in 
the reversible process, then 8S must also be zero. In the diagram, then, 
we proceed along a horizontal line. The point of intersection with S a 
determines the final temperature. If we were to start from a temper- 
ature T a ' or lower, we would attain the absolute zero by the time the 
S p curve is reached. However, according to the Nernst theorem there 
can be no entropy difference between the two forms at the zero point, 

* By “ magnetized ” we shall mean, more accurately, a paramagaetio body in 
a magnetic field. The coincidence of So and Su at T — 0 means that a body at zero 
behaves as though a field were present, i.e. it becomes ferromagnetic. 
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and so the curves must run as shown in fig. 2b, in which case the zero 
point cannot be reached. Starting from a temperature T, the steepness 
of descent of the curve S a determines how close to T — 0 one may 
come. 

No matter what adiabatic process is considered, the same con- 
clusion results. If, for instance, in analogy with the expansion of a 
gas, we seek to cool a solid * by sudden extension, we get similar 
curves for two different pressures, since at any given pressure s must 
be zero for T = 0. 

Despite the fact that the unattainability of the absolute zero can 
thus be deduced from the Nernst law, the two must not be considered 
equivalent formulations. This is the case only if we add the experi- 
mental fact that the specific heats of all substances approach zero at 
least as T 3 (see p. 612). 


* At low tempera* area gases no longer exist as Buck. 
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CHAPTER XXXIV 


The Elementary Kinetic Theory of Matter 


1. Bernoulli’s Formula; Boyle’s Law. 

As already indicated on p. 496 , an ideal gas is defined from the 
atomistic point of view as an aggregate of particles which have no 
extension, and between which no forces operate. These particles are 
in motion amongst each other with all possible velocities. The ap- 
parently steady pressure exerted by the gas is due to the myriad 
impacts of these particles on the walls, from which they rebound 
elastically. Collisions between molecules are not possible for an ideal 
gas, since the particles have no extension. We wish to calculate the 
pressure. Resolve the velocity c of a molecule into its rectangular 
components c x , c y , c t . Consider a portion of a wall normal to the x-axis. 
Each time a molecule collides with this wall it suffers a change in the 
x-component of its momentum amounting to — 2mc x , while the other 
components are not altered by the elastic collision. But according to 
Newton’s Third Law, the impulse communicated to the wall is also 
of- this magnitude. The impacts follow one another in such rapid 
succession that the series of impulses is felt as a constant force. If 
each impact lasts a short time At, the average force F x which the wall 
applies to the-molecule is given by (p. 108): 



According to Newton’s Third Law, the force exerted by the molecule 
on the wall is 

F,= — F? = + (1') 


If, instead of a single molecule, Av of them strike the wall in the time 
At, the force is multiplied by Av. The pressure is given by the force 
on unit area . If the number of molecules striking each square centi- 
metre in the time At is AN, the pressure is 


P = 


- 2 mc m . 


503 


(2) 
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The quotient AiV/A< is the number o£ molecules striking each unit 
area of the wall per second. 

All possible velocities occur, with respect to both direction and 
magnitude. With regard to the former, there is no uniquely indicated 
direction in space; we may therefore consider that, on the average, 
one-third of all the molecules move in the direction of each axis. Of 
each of these groups, half move in the direction of the positive axis, 
half in the negative direction. Further, we assume here that all the 
molecules have the. same speed. In the next second, all those mole- 
cules within a cylinder of base unity and altitude c normal to the wall 
which are moving toward the wall will collide with it, for a molecule 
distant c from the wall and approaching it will just reach it in one 
second, while all those farther away will not. If there are N molecules 
in each cubic centimetre, N/6 of these move toward the wall which 
is perpendicular to the aj-axis. In all, Ncj 6 molecules collide with this 
wall each second, so that Kmi* 



Since all possible velocities occur, c 2 is to be replaced by its average 
value c 2 , the mean square velocity. The square root of this quantity 
— the so-called “ root mean square ” velocity — is only approximately 
equal to the mean Velocity (cf. p. 593). 

Let ua add here a more exact deduction of formula (3) — one that takes into 
account the distribution of the velocities of the molecules. In reality, there is no 
molecule that moves with exactly a given speed in a given direotion at any 
instant. Bather, we must assume that in each cubio centimetre there are d?N 
molecules * whose velocities have magnitudes falling between c and c + dc, and 
directions lying within a solid angle d£l. 

Since all directions are on a par with each other, the number of molecules 
whose directions fall within a solid angle dCl will be proportional to the size of 
this angle : the number of molecules whose speeds fall between c and e -f- dc will, 
however, depend on c through an as yet undetermined distribution function /(c). 


Thus we may write 

d*N =f(c)dcdCi -= / (c) dc sin 6dQd§ (4) 

The total number of molecules in a cubic centimetre, given by integrating over 
the unit sphere and over all speeds, is equal to 

N — 4n J f(c)dc. (4') 


The number of molecules in dCl hitting unit area of the wall in time At is the same 
as the number contained in a oylinder whose base has unit area and whose length 

* The notation d*N is used to indicate that we are dealing with a third-order dif- 
ferential. 

The assumptions made here are not hard to understand in terms of an analogy. 
If one asks in a large assemblage of people, “ Who among those present is exactly 
forty-five years old this minute?” it is almost certain that no one will respond. But 
if one asks instead, “Who is betweem forty-five and forty-six years old?” several 
individuals may make themselves known, and the greater the interval the greater 
the number of expected responses. 
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is e At. Since the axis of this cylinder makes an angle 6 with the normal to the 
wall, the number is d 3 Nc-cosd-At. Each molecule exerts a force 2mc cos 6/AZ on 
collision, so that by integrating over all directions and speeds the pressure is found 
to be 


-co -?i/2 a -co 

=J J j J(c)dc sin 6 . 2mc 2 cos 2 0 d0 d<{> = ~ m j f(c)c*dc. 


If we deal with any multiplying function (p of e, it is logical to define the mean 
value tp as 

$ = +(«>/(«>** (6) 


f f(o)de 


Hence, (5) becomes 


Consider, now, one mol of gas. This contains L molecules, so that 
the number of molecules in one cubic centimetre is N = Ljv. If we 
substitute this value in Bernoulli’s equation, we obtain 


This is the law of Boyle and Mariotte. Identifying the right member 
of (8) with the right member of the gas equation (p. 495), we obtain 
a relation between temperature and mean square molecular velocity: 


_ Lmc 2 
3 ~ 


3 RT 
~ Im ' 


This shows that the mean square velocity of the molecules is directly 
proportional to the absolute temperature and inversely proportional 
to the molecular weight Lm. 

A complete kinetic theory derivation of the entire equation of state 
requires statistical methods, and will be carried out in the following 
chapter. 


2. Number of Collisions and Mean Free Path for Real Gases. 

The assumption that the molecules are particles with no extension 
in space is of course a drastic simplification of the actual situation. 
We obtain a better representation of the facts if we attribute 
finite size to the molecules, considering them to be elastic spheres of 
radius r. The resulting correction of the equation of an ideal gas has 
been considered in detail on p. 497. At this time we wish to consider 
only the collisions between the moving molecules. If all the molecules 
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with the exception of a single one were at rest, it would be a very 
simple matter to calculate the average number of collisions 
experienced by this molecule. This computation will be carried 
out here. There is a collision whenever the distance between the 
centre of the selected molecule and the centre of any other mole- 
cule is less than d = 2 r. The sphere of radius d, into which the 
centre of no other molecule can penetrate, is called the “ sphere 
of influence ” of the molecule. If the average velocity of the 
selected molecule is 6, the sphere of influence sweeps out a cylinder 
of volume nd 2 c in one second. Let there be N molecules per cubic 
centimetre; this cylinder will then contain NttcPc molecules and, 
under the assumptions made, there will be 

Z' — irNdtc . (10) 

collisions per second. 

In reality, the other molecules are also in motion, and so the mean 
velocity of a single molecule must be replaced by the mean relative 
velocity c r of the molecules with respect to each other. This velocity 
is readily computed if, for simplicity, we neglect the distribution of 
velocities and calculate with a single velocity c. But since all possible 
directions between velocity vectors occur, we still obtain a distri- 
bution of relative velocities, the mean value of which we shall calcu- 
late. If 6 is the angle between the velocity vectors of two molecules, 
the relative velocity is 

— — = — = e 

C, = V c 2 + c 2 — 2c 2 cos 6 = 2c sin^ 

The spatial average of the trigonometric function sin (0/2) must be 
determined. In general, the mean of a function of the spherical co- 
ordinates is obtained by multiplying the function by the element of 
area of the unit sphere, integrating over the sphere, and dividing by 
the area of the sphere, i.e. by in: 

m <f>) = ~ f £m *) dd 


In the present case we have 

~~0 2 r . 

S1I1 2 “4» rl 0 J 0 su V' Il 2 ™ a 2 u " u ' v ' 3’ 
Hence the number of collisions per second becomes 


. . 01 ) 
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The more rigorous calculation, taking into account the distribution 
of velocities, leads to the formula 

Z = V2t rNd?6 (11') 


The difference in the numerical factors 4/3 = 1-33 and \/2 = 1-41 
is about 6 per cent. In view of the inaccuracy inherent in the nature 
of the calculation, and in particular the indefiniteness of the concept 
of atomic sphere of influence, the difference is of no practical account. 

The mean free path l is defined as the average distance traversed by 
a molecule between two collisions, i.e. 


1 c _ 3 

Z~ 4 irNd? 


( 12 ) 


or, according to the more rigorous calculation, 

1 “ VtorNd* (12,) 

Let us investigate in more detail the significance of the mean free 
path. Take W(x) to be the probability that a molecule travel a distance 
x without suffering a collision. Then the probability of traversing the 
distance x -f dx is W(x + dx). This is composed of the probability 
that the path x will be covered and the probability that the added 
segment dx will be traversed also without collision. But according 
to the Theory of Probability, the probability that both events will 
occur is given by the product of the separate probabilities, since the 
two are independent. Then 

tJW 

fF(* + tfe) = W(x) + ^dx^W{x)W{dx). . (13) 


Now the probability of free passage through a layer of thickness dx 
is readily found. Imagine a cylinder whose length dx is parallel to 
the direction of motion and whose cross-sectional area is unity. The 
probability of a collision within the distance dx is obviously given by 
the ratio of the combined areas of the front surface, through whioh 
molecules which later suffer collisions pass, to the entire front surface 
of unit area. The former quantity is equal to the sum of the pro- 
jections, on the front surface, of all the spheres of influence lying 
within the cylinder,* i.e. rrNdHx. Hence the probability that the 
segment dx is traversed without collision is given by 

W(dx) = 1 — irNcPdx (13') 

•This is valid only for an infinitely thin layer; otherwise, the projections of the 
spheres of influence will overlap. 
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We then have W(x ) + dx = W{x) — W(x)ttN d 2 dx. 

Integrating, W = Ce~ nNd ‘ x (14) 

The constant C is determined from the fact that the probability is 
unity for a path of zero length. This makes 0 = 1, so that we have 

IF (a;) = er nNd ' x = e~ x!l \ 

The expression rrN d z is, by (10), the reciprocal of the free path 
when all molecules but one are at rest, as we assume is the case 
in the present instance also. If the motions of all the molecules are 
taken into account, the reciprocal of the free path l is 4rrNd 2 ji, and 
the relationship 

W(x) = tr* n (16) 

remains valid. Thus we may look upon the mean free path as that 
path over which the probability of passage without collision is e _1 . 
The distribution function of the free path is of simple exponential 
form, hence the mean free path is not a free path of particularly frequent 
occurrence , but merely the mean value, as may be verified by averaging 
over all free paths, i.e. by evaluation of the expression 

f*xW(x)dx e~ xll xdx 

f W(x)dx f e~ x,l dx 
o o 

3. Viscosity and Heat Conduction in Gases. Determination of Avo- 
gadro’s Number and of the Size of the Molecules. 

(a) Viscosity at moderate pressures 

Inasmuch as the results are very different, depending upon whether 
the mean free path is small or large compared with the dimensions of 
the vessel, we treat the case of moderate pressures and that of low 
pressures separately. It should be remarked that the low-pressure 
region for this purpose is that below a few hundredths of a millimetre 
of mercury. 

Let a gas be in motion parallel to the x-axis in such manner that 
all the molecules in a given layer have a common forward speed u. 
Assume a velocity gradient dujdz along the z-axis normal to the x-axis. 
The hydrodynamics of viscous fluids gives for the force between two 
layers of area S, in contact along the plane z = 0, the expression 
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We wish to compute the coefficient of viscosity ij on the basis of kinetic 
theory concepts. According to Newton’s Second Law, the force which 
the upper layer exerts on the lower is equal to the rate of change of 
momentum of the latter. But the change of momentum can be calcu- 
lated on the molecular theory. As a result of the thermal agitation, 
individual molecules pass from the upper layer — where the flow 
velocity is greater — into the lower layer. These, molecules possess 
forward momentum corresponding to that existing in the layer where 
they experienced their last impact. In the same way, molecules dif- 
fusing from the lower, less rapidly moving layer into the upper stratum 
cause a decrease of forward momentum there. The process may be 
likened to the case of two trains moving in the same direction along 
parallel tracks but with somewhat different speeds. If the passengers 
jump from one train to another, the faster one will be slowed down, 
the slower one will be accelerated. 

We shall now express this idea in mathematical form. Let the 
number of molecules per unit volume having the velocity components 
v x and v t be given by v(v x , v z ), where v is the resultant of the gas- 
kinetic and flow velocities. Then the number of molecules passing 
through unit area of the ®y-plane per second is w t ; these molecules 
transport momentum whose ^-component amounts to v(v a , v t )v t mv x . 
In all, the force on each, square centimetre is 

g = Sv(«„ v x )v.mv u , (17) 

where the summation is to extend over all values of v x and v, which 
occur. The evaluation of this sum is accomplished as follows. First 
we have 

v x = c x + u, v t = c, (18) 

If a molecule has experienced its last collision in the layer z, 

«(* ) = u o + z % (* 9 ) 

where u 0 is the velocity in the stratum z = 0. Using this value, we 
obtain 

£v(» x> v t )mv x v z = %v(v x , v,)m {c x ~f u 0 + z c t . (20) 

The terms Evc x c z and Evw 0 c, cancel out, since there are just as many 
molecules with positive c x and c z as with negative. However, the sum 
iVm v a) zc t does not cancel out, as may be seen by putting z — r cos d. 
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Cj. — c cos 6, in. winch case the factor cos 2 9 appears. If we replace r, 
c and cos 2 6 in 

2p(%,, v,)mv x Vi = 2 p («„,, « e )»wc cos 2 0 ~ . . (21) 


by their mean values l* 
of %v(v x , v M ) = N, 


c and 1/3 respectively, we obtain, on account 


F Nmlc du 

S 3~ dz' 


(22) 


whence, by comparison with (16), 

Nmlc 
^ ~ 3 * 


(23) 


Inserting the value of the mean free path from equation (12) (p. 567), 


— ^ — Mi 
V ~ 


(24) 


where M is the molecular weight of the gas and L is Avogadro’s 
Number. This formula is of importance on account of the fact that N, 
the number of molecules per cubic centimetre, does not appear in it. 
This means that the viscosity is independent of the density and hence 
of the pressure of the gas. Although this result may seem contrary 
to intuitive views, it is well substantiated by experiment. It is only 
at very low pressures, where the mean free path becomes of the order 
of magnitude of the dimensions of the container, that the theory must 
be modified. In this case there is a dependence on density. Quali- 
tatively, it iB easy to see why the viscosity is independent of the density 
under normal conditions: despite the fact that the number of mo- 
mentum carriers increases with the pressure, their mutual hin- 
drance also increases in -the same direction, thus tending to offset any 
change. 

(6) Conduction of heat at moderate •pressures 
Instead of a velocity gradient, assume that there is now a tem- 
perature gradient in the gas in the direction of the z-axis. According 
to p. 489, the flow of heat through the jcy-plane is given by 




dT 

dz‘ 


(25) 


•At first sight, it may seem questionable whether the mean free path l may be 
substituted for the average value of r, since the next collisions may not take place 
precisely in the plane z — 0. In this connexion it must be remembered that there is 
no propagation of probability for the collisions; i.e. regardless of which point of the 
path we reckon from, the probability of traversing the segment r is er Tl1 , so that r *> l, 
Henoe the mean value of all paths described by the molecules after passing the plane 
z — 0 is l, and the same is true, of course, for the original question. 
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where the temperature is denoted by T. We can calculate the co- 
efficient of heat conduction on the basis of the kinetic theory, just as 
we did for viscosity. Instead of a transport of momentum, the passage 
of molecules from one layer to another effects a transport of thermal 
energy. Again, each molecule is assumed to possess the energy of 
thermal agitation corresponding to the layer in which it experienced 
its last collision. The thermal energy per molecule is 

— z'-i^+S*) < 26 > 

where c„ is the specific heat. If there are v(u, c z ) molecules per cubic 
centimetre with the velocity component c, and thermal energy u, 
the flow of heat is given by 

— + = (27) 

Replacing c, r and cos 2 # by their average values, we obtain 


Nice, d£ _ c,c dT % 
3 L dz ~ iirLd? ~dz' 


(28) 


Thus the coefficient of thermal conduction is also independent of the 


pressure. 

It is, of course, not quite correct to replace the quantities c, r and 
cos 2 # by their mean values in the sum, but again the rigorous theory 
yields a numerical factor which differs but little from that' obtained 
here. The resulting factor is 0-247, which replaces 1/4. 


(cT Viscosity and heat conduction at low pressures 
. Consider next the important case of very low pressures, where the 
free path is large compared with the dimensions of the vessel and so 
essentially only collisions with the wall take place.* 

Imagine the gas placed between two parallel walls, one of which 
moves past the other with a speed u. The moving plate exerts a force 
on the one at rest, since the gas molecules that hit the moving plate 
have its forward speed added to any velocity they already possess, and 
communicate this to the other plate at their next impact with it. In 
order to compute the momentum communicated, consider the mole- 
cules whose directions of motion lie in a zone forming the angles # and 
# + dd with the normal to the wall. Sinoe we are concerned only -with 

* m order to gain an idea of the numerical magnitude, notice that at room tem- 
perature the vapour pressure of mercury is 10 -3 mm. and that the mean free path in 
this instance is about 10 cm. 
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those molecules that move toward the wall, we must take half the 
number. The number in each cubic centimetre will then be 

dN — C smddd, 

where the proportionality factor C, evaluated by integration over a 
hemisphere, turns out to be N/2. If the mean speed is c, the number 
of impacts per second per square centimetre becomes 

Z =i.~ f sind cosd dd = 

4 Jq 4 

Since each molecule gets an additional speed u, the momentum transfer 
per second, land hence the viscous force per unit area, amounts to 

F = ^um = ^u, (30) 

where p is the density and is equal to pM/RT. Then, by (9), p. 565, 
we have approximately 

*=!«/$?“• ^ 

Inasmuch as there is no velocity gradient in the space occupied by 
the gas itself, we cannot speak of a coefficient in the sense of equation 
(16); rather, the momentum transfer is independent of the distance 
apart of the walls. 

In the same way, one finds that the heat transfer per unit area at 
very low pressures amounts to 

w =l pJmi T i- T & < 31 ) 

where T is to be set equal to -f- T s ). As long as the mean free path 
is large compared with the dimensions of the vessel, viscosity and 
heat conductivity are proportional to the pressure but independent of 
the dimensions of the container. 

(d) Kinetic Theory determination of Avogad/ro's Number 
and of molecular diameter 

Since we may replace c approximately by Vc® = \Z3RT/M (the 
exact relation, as given on p. 592, is c — V8/3w . Vc*), the only un- 
knowns remaining in equations (24) and (29) are L and d. Since both 
occur only in the combination Ld?, a further relation is needed to 
determine these quantities separately. Such an equation is that for 
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the van der Waals b which, as we already saw on p. 497, is equal to 
four times the space occupied by the spherical molecules: 


, .47 r d 3 T 

‘“‘■I ! 1 


(32) 


Solution of the two equations for L and d yields numbers of the order 
of 10 -8 cm. for the molecular diameters and about 6 X 10 23 for L, 
which was first determined in this way by Loschmidt. However, the 
accuracy of this kinetic theory determination of L is not as great as 
that involving the electronic charge e = F/L, from which is obtained 
the generally accepted value 6-02 X 10 23 . With reference to the gas- 
kinetic diameter, it must be remembered that this is not the actual 
diameter of a small, impenetrable sphere, but rather — according to the 
way it was defined — the radius of the sphere of influence for mole- 
cular collisions. In view of the electrical structure of atoms and mole- 
cules, this sphere of influence is a result of the mutual electrical in- 
fluences of the colliding bodies. This does not mean that other methods 
of determining atomic dimensions would necessarily yield the same 
result. We shall find that the linear dimensions of the space occupied 
by the electrons is of the order of magnitude of 10 -8 cm. All experi- 
ments which depend on the electron shells lead to dimensions of this 
magnitude, byt experiments on the passage of a-partieles through 
matter lead to values of the order of 10 -12 cm. for the diameter of the 
“ centre ”, i.e. the nucleus of the atom. 


4. Derivation of some Properties of Crystals on the Molecular Theory. 

(a) Density and lattice constant 

While the beginnings of a molecular theory of liquids have been 
obtained by starting with a van der Waals gas and assuming the 
cohesive forces to be very large, there 
is no complete theory which explains 
all the main properties, as there is for 
gases. However, for crystals — which 
we may regard as the normal form of 
the solid state — the theory has been 
developed very extensively. On account 
of the elaborate mathematical equipment 
needed, we can give only the simplest 
parts of the theory. M. vcn Lane’s 
disoovery of X-ray interference in crystals 
(p. 385) proved the existence of the lattice 
structure which mineralogists had long 
assumed. To-day a large number of lattice types are known. We shall 
here consider only the simplest case — that of sodium chloride. Measure- 
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ment of the intensity of X-rays reflected from different lattice planes 
shows that the corners of this lattice are occupied not by the neutral 
atoms as is the case for diamond, for example — but by the same Na + and 
Cl“ ions as are encountered in electrolysis. The ions of each kind are 
distributed upoA a cubic lattice, the two lattices being interspaced, 
so that along any line the ions of the two varieties alternate (“ face- 
centred ” lattice). If no distinction is made between the two kinds 
of ion, the crystal represents a simple cubic lattice, in which only the 
corners of the cubes are occupied (cf. fig. 1). The lattice constant a 
is the distance between two homologous ions of the same kind. An 
elementary cell is the smallest unit by whose translation the entire lattice 
can be generated. In the present case it is a cube of edge a. In order 
to calculate the density all that is necessary is to count the number 
of Na and Cl ions in the elementary cell. It must be borne in mind, 
however, that each corner ion belongs to eight cubes, and so is to be 
counted only as 1/8; similarly, a point on an edge is to be counted 
as 1/4, one on a bounding face, 1/2. Thus there are (8/8 + 6/2) 
Na + ions and (12/4 + 1/1) Cl~ ions corresponding to each elementary 
cell. Representing the masses of the individual atoms by M^.jL and 
M C \jL respectively, the density is 

__'4(M Na -f M cl ) M M 

p v- • • • (33) 

4 

Hence the lattice constant is given by 

o = (34) 

This quantity is used to measure the wave-length of X-rays. The 
absolute value is again limited by the accuracy to which the Avogadro 
constant is known. Recently it has been found possible to use ruled 
gratings, of known grating constant, for X-rays, thus measuring the 
wave-length directly. The value of the wave-length may then be used 
to determine the lattice constant; this, together with the density, 
leads to a new determination of L. The value thus found agrees admir- 
ably with that obtained from recent oil-drop determinations of e. 

The method given here for obtaining an equation connecting the 
density and the lattice constant is, of course, perfectly general although 
equation (33) is not. 

(6) Nature of the cohesive forces in a crystal. Lattice energy 
of ‘polar crystals 

■What holds a crystal lattice together? What forces hinder a further 
decrease in volume, so that even very great pressures succeed in pro- 
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during only a slight contraction of a solid body? The first question 
is easy to answer for crystals which are built “tip of ions, the so-called 
polar crystals: the electrostatic attraction between the ions holds 
the structure together. The second question corresponds to that con- 
cerning the nature of the interatomic or intermolecular forces which 
give rise to the spheres of influence. Instead of regarding the ions as 
spheres of a definite radius, it is more natural to introduce forces of 
■ repulsion varying as a high inverse power of the distance. The 



higher the power, the greater the relative effect at small distances, and 
the better the correspondence to the collision of two rigid spheres, 
where the large forces which come into play at the instant of impact 
prevent a closer approach. The Quantum Theory has made clear the 
nature of the interatomic forces: they must arise when the electron 
shells interpenetrate. For the alkali halides the inverse tenth power 
of the separation proves to be the appropriate law of force. In fact, 
the Quantum Theory shows that a single power is really not sufficient 
— the exponent itself depends upon the distance. 

The Quantum Theory has also thrown light on the nature of the 
forces of attraction in non-pqlar crystals, like diamond. These 1 forces 
correspond to the binding forces between uncharged atoms, c.g. the 
H atoms in the hydrogen moleoule, and are explainable on the quan- 
tum basis (cf. p. 712). 

If there were but two ions present, the mutual potential energy 
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of an Na ion and a Cl ion, say, would be given by 



where j3 is a factor not yet determined. It would be incorrect, how- 
ever, to say that the potential energy of the entire lattice is simply 
the above energy u multiplied by the number of atom pairs present, 
for since the electrostatic potential decreases only with 1/r, the 
more remote ions also contribute to the potential at any point. The 
somewhat troublesome computation of the potential due to alternate 
positive and negative charges was made by Madelung, but cannot be 
reproduced here. The result is that the potential energy per ion bond 
is -0-2905 (e 2 /?). We therefore put 

«= -0*2905- +4 (35) 

7* r 

If we make a graph of this potential energy function in terms of the 
distance r, we obtain a curve having a minimum, since the negative 
term preponderates for r large, the positive term for r small (fig. 2). 
According to the laws of mechanics, this minimum . corresponds to the 
equilibrium distance r 0 . But the minimum condition dujdr — 0 can 
be interpreted also to mean that the force — which is obtained by 
taking the potential gradient — vanishes at the equilibrium distance; 
i.e. the forces of attraction and repulsion are of equal magnitude. 


This serves to determine j8, for 

- 0-2905 ^ + ~io = °, 

•b* n > 10 

whence 0 = i (0-2905) e\\ (36) 

and thus, in general, 

« = - 0.29056 2 (l - ~ ^ (37) 


We obtain the total potential energy of the lattice in the position of 
equilibrium (r = r 0 ) in the following way. In the Nad lattice, six 
bonds connect each ion with the neighbouring ions of opposite sign. 
Since we count each bond but once, we may consider only the ions 
of one sign, say the Na + ions. The lattice contains L Na + ions per 
mol, so that the energy of each pair must be multiplied by 6L. We 
then obtain the lattice energy per mol as 


u, = - l-743Le2 

yr 0 


(38) 
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Using the fact that r 0 = 
we have 


■ a/2 and employing equation (34) (p. 574), 


u { = — 547 X 10 3 


-^Nm + M a 


tt, = — 547 x 10 s 




in general, 


where the energy is expressed in calories.* This' latt __ 

for all salts of the NaCl type, is negative, since energy is liberated 
when the oppositely charged ions approach each other. This is then 
the work which must be done to separate the parts of the lattice and 
scatter them infinitely far apart. It is not equal to the heat of vapori- 
zation, for in that case the lattice is converted into molecules. 

(c) The compressibility of polar crystals 
The potential curve shows why a crystal resists changes in volume. 
As soon as the distance between two ions is increased from its equili- 
brium value, a large increase in potential energy occurs, so that a large 
amount of work must be expended. The force per ion pair is 

• ( 40 ) 


This force vanishes in the equilibrium position. We develop F in « 
Taylor’s Series at the point r 0 and obtain 

F = 0 + ~(r-r 0 ) + ...= 0-2905e 2 (~ - (r - r 0 ) . . . 


2-324e 2 


(r — fo) . . . . 


(41) 


The work done by the internal forces when one mol of crystal is so 
compressed that all interionic distances are changed from r 0 to r is 


V = 6L f F dr = -6 X 2-324 ^ f (r — r 0 )dr 
== _ 13 . 94 ^^ 8 . . . 


(42) 


The negative value shows that work must be supplied from without. 
But the work done by the internal forces is given by the integral 


f* pdv. 


Ib usually expressed in cal/mol, sinoe it represents a ohemioal 


(s 711) 
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Since the pressure is zero at the start and P at the end, we obtain 
an approximation to the integral by multiplying the total change in 
volume by the mean pressure P/2. But according to equation (33) 
(p. 574) the change in volume is given by 

At> = A = A (2Zr 0 3) = 6 Lr* (r - r 0 ). . (43) 

Equating the two expressions for the work, 


13-91 Le . 


f o) J = 62> 0 2 (r — r 0 ) |, 


(44) 


from which 

„ 13-94 e^r - r 0 ) 13-94^ 

6r 0 6 ~ 36Zr 0 5 . r 0 2 


13-94^ 1 At’ 

lSfo 4 !* « V 


and the. compressibility k becomes 

1C. 4 

k = = 5-60 X 10-V = 3-50 X 10 17 o*. . (45) 

This relation is very well substantiated experimentally, but it should 
be noted that the inverse tenth power, which has no other basis at 
present, is fitted to the compressibility. 

(d) Qualitative explanation of the thermal expansion of crystals 
The thermal motions of the particles constituting a crystal can 
consist only of vibrations of these partioles about the equilibrium 
positions to which they are bound. When an atom is in the position 
of equilibrium it is subject to no force, but as soon as it leaves this 
position, restoring forces arise which, by equation (41), are propor- 
tional to the displacement r — r 0 , and so give rise to harmonic vibra- 
tions. This proportionality between force and elongation, however, 
represents only a very rough first approximation. If we consider the 
potential curve (fig. 2, p. 575), we see that for larger displacements, 
where the higher terms of the power series must be taken into account, 
the forces become unsymmetrio, since the curve is much steeper on 
the approaching side than on the receding side. Hence if an atom has 
a given amount of kinetic energy when it passes through the position 
of equilibrium, this will become equal to the potential energy at a 
smaller distance from the rest position on the near side than on the 
far side. This means that the reversal of motion will take place sooner 
on the near side. On the average, then, the distance between two 
atoms will be increased, and this is the reason for the thermal expan- 
sion of crystals (cf. Ex. 40, p. 103). 



CHAPTER XXXV 

The Classical Statistics of Boltzmann 
1. Entropy and Probability* 

We wish now to treat molecular phenomena somewhat more ab- 
stractly by starting from a perfectly general relation which bridges 
the gap between thermodynamics and statistics, and deducing from 
it a large number of consequences. 

Prom the example of the diffusion of two gases through each other 
— a process involving an increase of entropy — we recognize that there 
must be an intimate connexion between the entropy of a system in 
a given state and the probability of finding it in this state. If an urn 
containing equal numbers of white and red balls is thoroughly shaken, 
the probability of finding a practically uniform mixture is enormously 
greater than that of finding only white balls in the upper half and 
only red balls in the lower. Thus the probability has increased to a 
large extent in the process of mixing; this is equally true for the 
diffusion of two gases, for which this model is intended. Is it possible 
to formulate this relationship mathematically? The entropy of two 
completely independent systems — e.g. 1 cm. 3 of oxygen at standard con- 
ditions in London and 1 m. s of hydrogen at a pressure of 10 atm. 
and a temperature of 300° abs in Manchester — is, according to the 
definition of entropy as an additive quantity, equal to the sum of the 
separate entropies, if we consider the two systems as a whole. The 
probability of finding both systems at the respective given conditions 
is, by the law of probability for independent events, equal bo the pro- 
duct of the separate probabilities. We thus have 


+ V (1) 

W n = W t W (2) 

What function satisfies the condition that 

( 3 ) 

Obviously the function k log a; is appropriate. We therefore take 
S — k log W ( 4 ) 

679 



580 HEAT. STATISTICAL PART [Chap. 

This is the celebrated Boltzmann Relation — one of the most impor- 
tant equations in physics. We shall find later that the factor of pro- 
portionality k is the “ gas constant per molecule ” R/L. 

How is the probability of occurrence of a thermodynamical state 
to be expressed? The probability of an event is usually defined in the 
Theory of Probability as the ratio of the number of cases favourable 
to the occurrence of the event to the total number of possible cases. 
This probability has a maximum value of unity, corresponding tor 
certainty. In thermodynamics, however, it has been found more con- 
venient to take simply the number of favourable cases as the measure 
of thermodynamic probability. Thus the thermodynamic probability 
is a large number. 

In counting the number of cases favourable to a given thermo- 
dynamical state, the expression N\ (“ N factorial ” = N(N— 1) 

(N— 2) 1) occurs, as in other branches of combinatorial analysis. 

Since we deal here with numbers of the order of magnitude of L, i.e. 
10 23 , the evaluation of Nl would be out of the question. 

Fortunately, however, there is an approximate formula due to 
Stirling which gives a good approximation when N is large. If we 
write 

logiV! =,logl -f log 2 -j- logo + log 4 -f- . . . logiV, 

we may look upon the series as a sum of rectangular strips whose 
breadth is A®= 1 and whose height is log®. When x is large the 
intervals of unit width may be regarded as differentials, so that the 
sum may be replaced by the integral 

J log xdx — [® log® — ®]j* = N log N — N + 1 . 

Neglecting 1 as compared with N and taking the antilogarithm of 
both members, we have the useful approximate formula for large 
values of N:* 

m ~ (f y (5) 

2. Calculation of the most Probable Distribution of Density in an Ideal 
Gas. 

We shall calculate the thermodynamic probability and its maximum 
value, corresponding to the state of equilibrium, for the density dis- 
tribution in an ideal gas. Here we already know the result: at equili- 
brium the density must, of course, be the same at all points. Although 
the case is trivial, it is — on account of its simplicity— suitable as 
an explanatory example of the method of computing probabilities. 

* Stirling’* Theorem states more exactly that log (N\) (N + J) log N — N + 

J log (2ir), the error being approximately 1/(1 2N). 
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In the thermodynamic, macroscopic sense we characterize a given 
state by giving the density as a function of position. But for an 
observer capable of seeing and distinguishing between individual 
molecules, a given state would be defined only by stating the co- 
ordinates of each molecule. It is clear that any particular “macro- 
state ” can be realized by a large number of different “ micro-states ”, 
for the interchange of two molecules does not alter the density dis- 
tribution in the least. We therefore look upon the number of micro- 
distributions corresponding to a given macro-state as a measure of the 
probability of occurrence of the thermodynamic state. In order to com- 
pute this number, we divide the volume into a number of equal cells 
of volume At. The density is then given by the ratio of the number 
of molecules N t in each cell to the volume of the cell. The cells are to 
be small, but nevertheless large enough to include a great number 
of molecules, so that the density NJ At remains a continuous function 
of position. If we were to make each cell of the order of magnitude 
of the average space available to a molecule, we would notice relatively 
large discontinuities in density from cell to cell, according as 0, 1, 
2, 3 . . . molecules happen to be in each. 

Let the cells and the molecules be numbered, and suppose that 
the molecules are thrown into the subdivided volume much like a 
handful of glass beads. Since the cells are all of equal size, it is equally 
probable whether a molecule X falls into the cell y or the cell z. That 
is, the division into equal cells has assured equal a priori probability 
of the cells, as it is usually expressed. This equal a priori probability 
is not disturbed if there are already several molecules in a cell; it is 
assumed that there is sufficient room for others. 1 " Thus, in order to 
compute the thermodynamic probability, we need only count the 
number of ways in which N molecules can be distributed among Z 
cells in such way that there are N { molecules in the ith cell. How 
often this state can be produced is best seen from an example. Let 
the following distribution be given: 

Cell: 1 2 3 4 5 6 

Number of molecules: 1 0 2 0 3 4. 

Such a distribution of ten molecules among six cells may be realized, 
for example, in this way*. 

Molecule number: 123456789 10 

In cell number: 616565335 6. 

Evidently there are as many micro-states as there are permutations 
of the cell numbers of the molecules. It is to be noticed that the change 
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in position of the molecules within one cell does not yield a new micro- 
state (or “ complexion ” as it was termed by Gibbs and Boltzmann), 
for the micro-state is determined solely by giving the cell numbers. 
Thus the probability of the distribution is equal to the number of 
permutations of the lower row of the preceding table. If all the cell 
numbers were different this would be N\, but since, for example, the 
number six occurs four times, all permutations of these four numbers 
among each other are omitted. On account of the fourfold occurrence 
of the number six we then have only the 1/4! part. In general, 

W- - m - m (6) 

n {Nf\) w 


Applying the Stirling approximation and remembering that E2V, = N 
we have 


W — 


N ' 

UN?*’ 


(&) 


We now introduce /(t), a density function of the cell index, defined by 
2 ^/(*)At<=/(*)At (7) 

As long as all the cells have the same volume, we can omit the sub- 
script on At. The entropy then becomes 
S = k{N log N - S N t log NA = k(N log N - S/(i) At log/(t) 

— 2/ (*’) At log At). . . (8) 


The .equilibrium value is characterized by the vanishing of the first 
variation: 

_ SS = 0 = S 8/(»)At log/(t) + E 8/(i)Ar + S 8/(i) At log At. (9) 

It is to be remembered that an auxiliary condition is given by the 
fact that the total number of molecules is to remain constant: 

E/(»)At == N, i.e. E 8f(i) A t — 0. . . . (10) 

The variations of the distribution function are therefore not arbitrary; 
equation (10) must always be satisfied. If we apply the method of 
undetermined coefficients (p. 115), multiplying equation (10) by a and 
adding it to equation (9), the coefficients of the variations may be 
set equal to zero separately, and there results 

log /(») + a + 1 + log At = 0 or /( t) = const. (11) 

As expected, the most probable distribution is that of uniform density 
throughout. From equation (10), the value of the constant turns out 
to be N/V, where V is .the total volume. 

The division into regions of equal a priori probability is too narrow 
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a limitation. In many instances, e.g. in the distribution oyer the sur- 
face of a sphere, a division into zones of equal size is quite impossible; 
moreover, in quantum statistics it often happens that the individual 
cells must be given different a priori probabilities (weights), which is 
naturally equivalent to making the sizes of the cells different.* How 
are we to proceed in this more general case in which each cell has 
a volume At, of a priori probability < 7 ,? In any case, we set the total 
weight (?, equal to the cell volume multiplied by g { . If the distribution 
is such that one of the cells has twice the volume of any of the others 
(which have equal volumes), the probability that a molecule o should 
fall in this cell will be doubled, as will that for molecules b and c. Then 
the probability that all three molecules fall in this cell is the product 
of the individual values, and so increases to 2 s . In general, the factor 
to be applied to a combination in which N { molecules fall in a cell 
having a weight G f is G t m . Thus a combination in which there are N t 
molecules in the first cell, N 2 in the second, &c., has a factor G 2 N 'G 2 Nt . . . , 
and we obtain 

w Nina** N N m { N{ N N nG^ AT * 

~ ilv,! - njjpr - n[/(i)Arj/<*>^' * ( ) 

S = Jc[N log IV" + S/(t) At, log G t 

- S/(»)At, log/(i) - S/(»)At, log At,]. . (13) 


The calculation of the most probable distribution is carried out exactly 
as in the previous example. We have immediately 

y = 0 = SS/(i) At, log (?, - 2 8 /( 1 ) At, log/(i) - S 8 /(i) At, 

— 28/(i)At, logAT„ . . . (14) 

with the auxiliary condition 

8N = S 8/(i) At, = 0. (15) 

Multiplying by —a and adding equation (14), we obtain 

log (7, — log/(i) — 1 — log At, — a — 0, . . (16) 

or f(i) — const. (17) 


Since S/(i*)At, = N, the constant is N [LG ,, so that the most probable 
distribution is represented by the function 

'' NG < _ _ N 9i_ 


f(i) = 


At,S G { Sf/, At," 


(18) 


* In many install cos the a priori probabilities have been obtained in such manner 

that the connexion with experience was realized after their introduction, so that here 
we are really dealing with weights deduced from experience — a posteriori probabilities 
— whose firmer theoretical foundation (and hence transformation into true a priori 
probabilities) must be left to further developments. 
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With equal a priori probabilities g t — 1, the result is 

= EAr, = V ( 19 ) 

Thus even if the cells differ in size, as long as they otherwise have no 
special a priori probabilities, the most probable distribution is that of 
uniform density. 

Ex. 128. It follows from the formulae of p. 530, that, if a paramagnetio sub- 
stance could exist in an unmagnetized state at absolute zero (this is denied by 
the Nemst theorem), its entropy per mol would in the simplest case be greater 
than in the magnetized condition by B log 2. Deduce this result statistically, 
remembering that in the simplest ease only the orientations parallel and anti- 
parallel to the field are permitted. 

3. Representation of the Distribution of Position and Velocity by means 
of Phase Space. Liouvilie’s Theorem. 

We wish to know not only the position, but also the distribution 
of the velocities of the gas molecules in the state of equilibrium. In 
order to compute the probability of a given state, it is natural to seek 
to apply the procedure of the last section to this more complicated 
case. To do this, we need only replace actual three-dimensional space 
by a hypothetical six-dimensional space in which the co-ordinates of 
a point are the three space co-ordinates of a molecule together with 
its three velocity components.* Thus every molecule in actual three- 
dimensional space has a representative point in this six-dimensional 
space. One might raise an objection to this procedure. In ordinary 
space it is immediately evident that the a priori probability of a cell 
is proportional to its size, but how can we be certain about the a priori 
probability of a cell in the hyperspace ? If, instead of taking the velocity 
components, we choose the general momentum components p k (cf. 
p. 121 ), the space built up of the position co-ordinates q k and the 
momentum co-ordinates p k is called a phase space. For this space it 
can be shown that the cells of equal a priori probability are of equal 
size or — what is the same thing — that the a priori probability of a 
cell of phase space is proportional to the size of this cell. For if we 
consider a volume element At of this space, representative points will " 
enter and leave through the bounding surface of the element, and we 
can compute the “ source-strength ” (divergence) for the element 
exactly as in the three-dimensional case. If '' 

r = q x e x + q 2 e 2 + q 3 e 3 + ?1 e 4 + p 2 e 5 + p 3 e e , . (20) 

* For the present, we limit oar considerations to monatomic moleoules whose 
condition is uniquely determined by the position and velooity of the centre of gravity. 
For polyatomio moleoules we have angular co-ordinates and velocities also. 
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the velocity C of a representative point is 

C = C 1 e l -f- C 2 e 2 + C z e 2 + C t e 4 -j- C s e s -f- C e e a 




dt 6s+ dt & 


585 

( 21 ) 


Choosing a small six-dimensional cube as the element of volume, we 
have, exactly as in the three-dimensional case, 


,. n_ 9Cf i . dC * , 3C a 0C 4 , dC s dC e 

8 ? i + dq a + dq a + dp 1 + dp 2 + d Ps 




Now the molecules themselves obey the fundamental laws of mechanics, 
which we may write in the Hamiltonian form: 


dp k dH dq k dH 

dt dq k ’ dt dpt 


(23) 


whence it follows that 


d dp k 
dp k dt 


+ 


d dq* 
Fq k dt 


= 0 . 


(24) 


On account of this last- equation, every term in (22) cancels out the 
other term having the same subscript, and so the entire right-hand 
member vanishes. Now the vanishing of the divergence of the velocity 
is characteristic of the flow of an incompressible fluid; if a given 
number of particles (in our case, representative points) occupy an 
element of volume A r f at a certain time, the^ will occupy another 
volume of equal size at a later time. Thus the relationship between 
two equal volume elements in the phase space is such that if one is 
occupied by N t points the other is also filled with N { points. Since, 
with the passing of time, all elements of the phase space are traversed 
by the flow, all elements of equal size have the same a priori probability. 
This is Liouville’s Theorem. We have thus found a basis for com- 
puting the probability of a distribution in position and velocity. These 
consideration are not limited to a phase space of six dimensions, but 
hold for any hyperspace. 

A diatomic molecule may be visualized by means of a model having 
the form of a dumb-bell. The radius of influence deduced in the elemen- 
tary kinetic theory for spherical molecules is merely an average for 
various orientations of the collidin g molecules. In the case of diatomic 
molecules we are concerned also with the orientation of the axis and 
the velocity of rotation about an axis normal to the figure axis, sp that 
the appropriate phase space has a greater number of dimensions. But 
since Hamilton’s Equations hold for any two canonically conjugate 
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variables, regardless of how many degrees of freedom the system 
has, the Liouville Theorem concerning the equal probability of equal 
elements is always valid as long as the particles whose representative 
points move in the phase space follow the laws of mechanics. 

4. The Maxwell-Boltzmann Energy Distribution. 

We are now in a position to apply the method of § 2 (p. 580) literally 
to the phase space. By so doing, we can answer the question of how 
the total energy V of a system is distributed among the individual 
particles. The total energy of an ideal, monatomic gas, for example, 
is made up of the kinetic energy of translation of all the atoms. How 
is this energy distributed among the various atoms? Do all atoms 
move with the same speed, as we assumed for simplicity in the pre- 
ceding chapter? A definite energy u(i) corresponds to each cell of the 
phase space; this is the energy possessed by a molecule whose repre- 
sentative point falls within this cell. For generality, assume that the 
cells are of various sizes. Then, by Liouville’s theorem, each will have 
an a priori probability proportional to its size. If, as in § 2 (p. 582), 
we again define a distribution function /(i) by N t — f(i)Ar { , the prob- 
ability becomes N N U Ar/^r, 

ir ~Tm ( ~ 5) 

and the entropy is 

S — kN logN — hh f(i)&.T< log/(i). . . . (26) 

The maximum value of S is to be determined under the auxiliary 
conditions that both the number of particles and the total energy V 
are to remain constant. The former condition may be stated 

N=2f(i) At* i.e. 8N = SS/(»)Ar,= 0. . (27) 

The energy of the particles whose representative points lie in the ith 
cell of energy u(i) is obviously Niii(i), so that the total energy is 
2 m (*)/(») A r<, whence 8U — 2S/(i)«(»)At,- = 0. (28) 

The variation of the entropy again leads to the expression 


= 8f(i) An log/(») + S 8/(t) A t,. . (29) 

If we again apply the method of undetermined multipliers by multi- 
plying the first auxiliary condition by a, the second by 8, and adding 
the resulting equations to (29), we obtain 

log/(t) + 1 + a + =0 (30) 

or setting e -<1+ ‘ > *= A, 

f(i)=Ae-* a(i) (31) 
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The constant * A, which is connected with the multiplier a, is again 
determined from the total number of particles, for 


so that 


N = 42*"*® At* 

Ne~*" 
Ze 


/(*') = 


(32) 

(33) 


The sum appearing in the denominator is called the partition sum of 
the system. We shall set this extremely important quantity equal 
to o: 

a — Ze~ 0u(i) Ar t . ...... (34) 

To determine the multiplier j8 we must return to the entropy. Putting 
in 

/(i) = ^e" M0 (33') 

the entropy becomes 

S= kN log N ~kZ^ e~ mi> Ar t (log N - jS u (i) - log a). (35) 


On account of (34), and since 

^ Ze _3 “ <0 m (i) At, = U (36) 

our expression simplifies to 

S= kpU + kNloga (37) 


If S is given as a function of V and F we obtain T by differentiation, 
according to p. 526: 



Further, equation (36) shows that U and j8 are functionally related, 
so that 



* In oomparing the present application of the method of multipliers with that on 
p. 115, where the quantities denoted there by A were functions of x, one may question 
the constancy of a and J3. In the earlier example, the auxiliary conditions were given 
as functions of x, and are to be satisfied for any value of x. Here, on the other hand, 
two simple equations are to be satisfied — equations which no longer oontain the cell 
subscript, since it was removed by the summation process. 
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Carrying out the differentiation, we have by equation (37): 

dS JTT f 10 dV . kNda 

.... ( 40 ) 

and from (34), = — Hu(i)e- pu ® Ar t — — . . . (41) 

Hence, by (39) there remains 

or ? = ■%, (42) 


The distribution function then becomes 

N e -mikT 


/(*> = 


(43) 


This is the celebrated Maxwell-Boltzmann Law of Energy Distri- 
bution. In the next section we shall learn several of its applications. 

Having determined j8, we obtain a simple thermodynamical inter- 
pretation of the quantity o; putting the value 1 /kT for ft in equation 
(37), there results 

5 = ^ + kN logo. 

However, the dimensions are not yet correct in this formula. As is 
seen from equation (34), a has the dimensions of a volume of the phase 
space. On the other hand, the argument of a logarithm must be 
dimensionless. Now, in the classical thermodynamics S is determined 
only to an additive constant, so that we are at liberty to add con- 
stants to the right member in order to make its dimension zero. Such 
a constant is — kN log At, where At is, say, an arbitrary measure of 
the volume of the phase space element. Since the product of a position 
co-ordinate by a momentum co-ordinate has the dimensions of energy 
X time, which we call action, [A], At has the dimension [A]* if 
there are s position co-ordinates. If we combine this added term 
with the term kN log a, we see that the unit of phase space volume 
must be taken to be [A]*, where the magnitude of [A]* is still unde- 
termined. But the Quantum Theory will show that [A] has a perfectly 
definite numerical value. We must therefore write 


S = ^ + kN lo g[ |p (44) 

Then the free energy becomes 

F = U-TS = -kNTlog~. . 


. • ( 45 ) 
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5. Applications of the Maxwell-Boltzmann Energy Distribution. 


(a) Halley’s Law and the equilibrium of sedimentation 


Assume that the energy of a particle of mass m depends only upon 
one position co-ordinate. For example, let the particle be in the.earth’s 
gravitational field and have a potential energy mgz. Then, if we take 
as cells the parallelepipeds of base unity and altitude Az, the number 
of particles between the levels z and z -j- Az is given by (43): 


A z, 


or the density is given by 

AN N -■ 
P = -aT = ~ e 


(46) 

(46') 


Instead of the function a 
the level z — 0, obtaining 


we introduce here simply the density p 0 at 


(47) 


Thie formula includes the Halley formula (p. 192) for the varia- 
tion of density (or pressure) with altitude in an isothermal 
atmosphere, for if we replace p 0 in the exponent of equation (4) 
(p. 192) by the value M/v 0 — where M is the molecular weight—; 
and if we use the relation p 0 v 0 — RT, the exponent becomes 
identical with that of equation (47), since it will be shown later that 
MIR = m/k* 

This formula is valid also for the distribution of visible colloid 
particles. If, for example, we are dealing with a mastic emulsion in 
water, the mass m is to be replaced by the actual mass of each particle 
minus the hydrostatic buoyant force. This means that in an emulsion 
of this kind the number of particles must decrease exponentially with 
the height, and this is actually observed. Thus, if the masses of indi- 
vidual particles can be found in any way, the counting of the numbers 
of particles at various levels will serve to determine the Boltzmann 
constant h, and hence the Avogadro constant® L by means, of the re- 
lation k — R/L which will be derived below. 


(6) Angular distribution of the axes of elementary magnets in a 
magnetic field 

It is now possible to justify the formula used on p. 465 for the 
distribution of the angle 6 which the axis of a dipole makes with the 

* Inasmuch as the temperature is assumed to be the same at all points, the dis- 
tribution of veloc’ties can have no effeot on the distribution of particles in altitude, 
so that the statistical calculations for the z-co-ordinate may be performed separately. 
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field. The potential energy of a small magnet of moment p in a field 
H is given by 

u=—pHco&6. ...... (48) 


The phase space is, in this case, the surface of a unit sphere upon 
which we can mark the several axial positions by points. The division 
into cells is best made by taking zones between the angles d and 
d + Ad. These zones, however, are not of equal size, since their areas, 
for equal intervals A d, are proportional to sin d Ad. We then have for 
the number of magnets whose axes make an angle with the field which 
lies between d and 6 + Ad: 


Since there is no restriction on the size of the intervals in classical 
statistics, ‘we may make them so small that the sum may be replaced 
by an integral. Hence in the denominator we have 

a = 2n J* e pHcoae,hT s in 6 dd. .... (50) 
The rest of the computation has already been carried out on p. 456. 

(c) Distribution of velocity in an ideal gas 

We now investigate the distribution of the velocities of the mole- 
cules of an ideal gas, limiting our considerations to the translatory 
motion of the molecular centres of gravity. Any rotational motions 
of polyatomic molecules about their centres of gravity will not be 
considered. These rotations can have no effect on the distribution of 
translational velocities, so that the results hold not only for monatomic 
gases, but for any molecular structures. The momenta are, in rect- 
angular co-ordinates, 

p m — mi, p y — my, p, — mz, i.e. p = mV& + y 2 + i 2 = me, (51) 
and we have 

— . . (52) 


The number of molecules whose centres of gravity lie between x and 
* + Ax, y and y -f- Ay, z and z -f- Az, and whose momenta are 
between p x and p x -f- Ap x , p v and p v -f- Ap v , p, and p, + Ap, is thus 
given by 

AN = ^ e~ ( **' +p * ,+V > /2, "* r Ax Ay Az A p x A p y A Pt . (53) 
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Dividing both sides by the element of volume, we obtain the number 
An of molecules per cubic centimetre whose momenta lie between the 
given limits. If we again replace the summation by an integration 
the denominator becomes 

a = ///// / e-GV+V+V)/*"**- dxdydz dp a dp„dp, 

= V / + “ jy e-W+PS+>'W r dp a dp v d P ' = VP. (84) 

The integral I, which occurs here three times — with different variables, 
but of the same form — cannot be evaluated as an indefinite integral 
in terms of elementary functions, but the definite integral between 
the limits — oo and +00 can be computed with the aid of a simple 
artifice. Set 

d Pa =V*mkTd£. . . . (55) 

The integral is then 

I = V2mkT .... (56) 

If the same integral is now written with the variable of integration ij 
and multiplied by the integral in f , we obtain 

P= 2mkT jy e-?d£ 

= 2mkT jy . . (57) 

Now the double integral is readily evaluated in polar co-ordinates: 
If we allow the polar angle to go from 0 to 2 n and the radius vector 
from 0 to oo , the plane will be completely covered, just as if the rect- 
angular co-ordinates f and tf were allowed to vary between the ’units 
—00 and + 00 . Hence 

P = 4w mkT jf e - '** pdp = —2TrmkT[e~ l,, ]g — 2mnkT. (58) 
Then a = V (2rnnkT) m , . (59) 

We are less interested in the distribution of the components of the 
momenta (or of the velocities) than in that of the magnitudes. If the 
momenta are represented as radius vectors, we inquire how many end 
points fall in the region between r and r -f Ar. Obviously, this number 
is obtained by adding all the points in the individual elements 
AxAyAz included within this shell. This summation is very ample 
since, according to (53), the density of distribution depends only upon 
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p* -4* -f- V* = P 2 ! and so is constant over the surface of a sphere. 

We thus have the sum given by the distribution function multiplied 
by the total number of elements contained in the spherical shell of 
this momentum space, i.e. simply the distribution function into the 
volume of the spherical shell: 


A Ne~ . 4-njPAp 
An FMfcfp • 


. (60) 


If we express p in terms of c and also set NjV = n we obtain the final 
formula 


An 

n 


4e a e-'« , '*« , Ac 


=f(c)Ac. 


• (61) 


The distribution function /(c) is plotted in fig. 1 for the case of 
nitrogen. Since this function vanishes at c = 0 and at c = «o , it must 



have an extremum at some point. In order to locate this extremum 
—which, in this case, is a maximum — we introduce the notation 




(62) 


for convenience. Setting the first derivative equal to zero gives 
dc a?V" 


a*y/-n e 


from which 


V m 


(63) 


This may be termed the most probable molecular velocity. More- 
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over, we also wish to compute the mean velocity. This is given by 

c= -2cA»= — 7- f f a?e~*‘dx 

n a* y/tt Jo V™ 

= f" x*e~*' d(x*) = - ~ 

Y 7r Jo y 7T L J q y t t J$ 

••••••••• (“> 


Finally, the mean square velocity is 

--= - Sc 2 An = r <*er#* dc^~[* 3*<r*'d{**} 

n a 3 V'/r J 0 v^-'o 

- - .+ « - - £ m: 

+^ v -^c 




3a 2 f 

3a 2 -^/tt 3a* 

= 2 v"n “ T 


= |c„ 2 ; Vc 2 - l-225c m (65) 

We have thus derived the relation between average velocity an d roo t 
mean square velocity which was given without proof in Chap. XXXTV. 


(d) The equation of state of an ideal gas; BoUzmann Constant and 
Gas Constant 

If the value obtained above for the function a oi an ideal gas is 
inserted in the equation for the free energy (pr588), one obtains 

F= - kNT lo g[ -~ = - JcNT log{F{27m&T) 8/2 }+ IcNT log [A]*. (66) 


Using the second equation of (34) (p. 527), viz. 


rob 

1 

II 

e. 

. . . (67) 

, , . kN T 

we obtam p = ~y~> . . . . 

... (68) 

or for one mol ( N — L), 


pv — kLT — RT. . . , 

. . . (69) 


Therefore, as already mentioned, lc is the gas constant per molecule: 
*-£- — X Q.Q2xlO^ [ ~ “ 1380 X 1(H8 e ^ de g' ™ 
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(e) Osmotic pressure 

On p. 547 this phenomenon was treated from a thermodynamic 
point of view. It is possible to attain the same result by a statistical 
approach. 

Unlike the pressure exerted by a gas, evident as the result of mole- 
cular impacts with the walls, osmotic pressure is less readily visualized. 
We can think of the membrane as a field of force affecting only the 
molecules of the solute — an assumption not in the least artificial since, 
in the last analysis, the effect of the walls of a vessel of gas is deter- 
mined by the field of force of the molecules constituting the walls. 
This assumption is particularly apt in the case of a membrane, for there 
must be a certain thickness of membrane material within which the 
selective permeability comes about. A particle approaching the wall 
will then obtain a potential energy u which rises sharply toward in- 
finity as penetration occurs. The force exerted on the particle by the 
wall is given by 

P = — gradw (71) 

Inasmuch as u increases as the particle approaches the wall, F must 
be directed into the solution, and according to Newton’s Third Law 
the particle exerts an equal and opposite force on the wall. Quali- 
tatively at least, the situation is the same as for the phenomenon of 
gas pressure. It remains to show that the gas laws actually hold for 
osmotic pressure. 

For the present, assume that equilibrium has not yet been reached 
on the solution side. In this condition the full pressure excess does not 
yet exist on the solution side, and the force field of the membrane 
repels the particles. Now, it is known that when a molecule or an ion 
moves in a liquid, viscosity causes some of the solvent to be dragged 
along (see p. 419). The liquid thus streaming away from the membrane 
is replaced by fresh solvent which passes through the membrane, for 
no barrier exists in so far as the solvent is concerned. The solvent 
streams in until the pressure excess equals the osmotic pressure. At 
equilibrium the Boltzmann distribution law holds, which means that 
the number of particles of energy u in each cubic centimetre is given by 

n = (72) 

Here « 0 is the number of particles per cubic centimetre at a point in 
the solution far from the membrane. The force on an element of volume 
of the solution containing n particles of energy u per cubic centimetre 


fdv — — n grad u dv. 


(73) 
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It follows from (72) that 

grad n = e~ ulkT « = — — , grad «. . (72') 


Substituting into (73), the resulting expression for the force per unit 
volume becomes 

f— kT grad n (74) 


According to the fundamental hydrostatic equation (p. 191 ), we have, 
at equilibrium. 


Integrating, 


gradp =f= kT grad ». (76) 

V = kTn + p 0 , (76) 


where p 0 is the pressure corresponding to » = 0, i.e. the pressure on 
the solvent side. The osmotic pressure is p — p 0 . If one mol of eolute 
is dissolved in a volume v of solution, then n — Ljv and 


RT 

v 


(77) 


Thus we have deduced the van’t Hoff law (p. 647) on a statistical basis. 


6. The Law of Equipartition and its Application to the Specific Heats. 


The energy of a single particle is, in most cases occurring in prac- 
tice, a homegeneous function of the squares of a number of position 
or momentum co-ordinates to which we give the unified designation 
Xi, x 2 ... x f . The total number / of parameters upon which the energy 
depends is called the number of degrees of freedom. This is unlike 
the usage in mechanics, where the number of position co-ordinates 
alone was called the number of degrees of freedom of the system. 
Under the condition that we have a homogeneous quadratic energy 
function, Euler’s theorem (p. 631) is valid; 


du . du , du „ 


(78) 


Passing directly to integration instead of summation, the total energy 
of the system is given by 


N J j J j . . . e~ v]kT udx x dx^dx i . . .dxf 

I Iff"' e~* lhTd *xfc% dx 3 • • * ff 


(79) 
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The numerator Z may be transformed with the help of Euler’s 
We ha ve 

////'•• 

+ ////••■ g^‘ *■' ■■■ ix ’ 

+ • • • + ffff ••• e~ ulkT x f dx l dx i ...dx t . (80) 

In the first integral we integrate with respect to by parts: 

J e~ ulhT x l dx 1 = —kT [z 1 e-“"‘ T ] + kT J e— :kT dx^. (81) 

In general, the limits are — » and +oo — at any rate, there are two 
limi ts of equal magnitude in x^, but of opposite sign. Then, since 
u is proportional to Xj 2 , the bracketed expression vanishes when the 
limits are substituted, so that only the integral remains on the right 
side. Substituting this in (79), the first term yields kTo. If we proceed 
in this way with every term, we obtain 

2 Z = fkTo (82) 


_ NfkT __ 


= Nfu. 


(83) 


This is the celebrated Law of the Equipartition of Energy in classical 
statistical mechanics: Each degree of freedom which enters quadrati- 
cally into the expression for the energy of the entire system contributes, 
on the average, kT/2 to this energy. 

This theorem may be verified in a simple manner by applying it 
to the calculation of the specific heats of gases. Since 


— /d u \ 

’ “ \df)f 


the molecular heat becomes, according to (83), 


(84) 


( 85 ) 


Since R is approximately equal to 2 calories per degree, 

c t> 
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of the translational velocity; the specific heat per mol is then 


c„ 


whence 


and 



(87) 

(88) 


for any monatomic gas. These relations have been verified from the 
lowest to the highest attainable temperatures. 

As a model of a diatomic molecule we take a rigid dumb-bell.* In 
addition to translational energy, the molecule has rotational energy. 
The rotation, too, may be resolved into components about three axes. 
The rotation about the figure axis itself does not contribute to the 
kinetic energy, since the moment of inertia about this axis is zero; 
hence there are two additional degrees of freedom of rotation for a 
diatomic gas, and we have 


This relation is also generally valid at room temperature. Neverthe- 
less, it has been observed — especially for hydrogen — that the specific 
heat drops to the value for monatomic gases as the temperature is 
lowered. This does not agree with the Law of Equipartition. 

Molecules consisting of more than two atoms have three degrees 
of rotational freedom, so that as long as we consider such molecules 
rigid (internal vibrations absent), the specific heat at constant volume 
is given by 


612 c. 4 
Cv ~ 2’ e.~ 3’ 


(90) 


which holds for any such polyatomic molecule. 

As we saw on p. 575, the atoms of a solid body are bound to their 
equilibrium positions by qqasi-elastic restoring forces. Thus an 
atom possesses not only kinetic energy but also potential energy 
which, by the results of p. 577, is also a quadratic function of the posi- 
tion co-ordinates. This means that for an atom of a solid there are six 

* In reality the two atoms may also vibrate in the line joining them. The reason 
that these vibrations contribute nothing to the specifio heat at ordinary temperatures 
will be seen in the next chapter. 
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degrees of freedom contributing to the energy, whence 



This is the same as the empirical Dulong-Petit Law of Atomic Heats. 
At room temperature this law is in general well obeyed — only the 
lighter elements (e.g. diamond) show departures. At lower tempera- 
tures, however, the specific heats decrease — as a matter of fact, to 
zero — which is what the Nernst Heat Theorem requires. But this 
decrease is also contrary to the Law of Equipartition. Thus it seems 
as if this law is valid at higher temperatures, but that at lower tem- 
peratures an anlcylosis * of degrees of freedom takes place. This indi- 
cates that the classical statistics must be modified in some way. 

9. Fluctuations. Transition from Microscopic to Macroscopic Motion. 

If thermodynamical equilibrium, corresponding to the maximum 
value of the entropy, is of a statistical nature only, departures from 
the most probable values are to be expected when very small regions 
are observed. For example, if it were possible to observe an element 
of volume containing but one hundred molecules we would observe 
relatively large fluctuations of density in this space; if but one addi- 
tional molecule should enter, the density would be increased by 1 
per cent. These variations in density are responsible for the scattering 
of light in the atmosphere. A knowledge of the theory of this scatter- 
ing makes it possible to calculate Avogadro’s Number from the spectral 
energy distribution of skylight. Again, imagine a small, movable 
surface element in the wall of a vessel containing a gas. Let the element 
be so small that but a few hundred molecules strike it each second. 
While the element would not respond to each impact separately, the 
likelihood of considerable pressure fluctuations is no longer small, and 
an irregular fluttering of the element is to be expected. This pheno- 
menon was discovered about a century ago by the English botanist 
Brown, but its nature remained unexplained for a long time. If we 
observe very small, barely visible particles suspended in a liquid 
(colloid partioles), these exhibit an irregular vibratory motion because 
of the fact that the impacts of the molecules of the liquid do not exactly 
balance at each instant, so that the particle is urged sometimes in one 
direction, sometimes in another. Looking upon the colloid particle as 
a large molecule, its kinetic energy must, on the average, be equal to 
hT/2; according to the Law of Equipartition. What we observe in 
the microscope, however, are velocities of much lower order of magni- 
tude, if we take the velocity to be tie distance covered divided by the 

* Poincar6 borrowed this very apt term from pathology to denote the “ freezing 
np ” of a degree of freedom. 
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time. In reality, however, the velocity alters its direction so often 
that the observed motion is merely a smcothed-over version of the 
actual zigzag path. The path of such a particle is suggestive of a type 
of curve occurring in higher mathematics — a curve which is everywhere 
continuous but yet has no tangent. It is quite hopeless to attempt — 
say by cinematography — to ascertain the details of the motion in order 
to determine the actual mean kinetic energy. Bather, if we wish to 
give a theory of the phenomenon which is amenable to experimental 
test, we must consider the mean displacement instead of the velocity. 
Consider only displacements in one dimension. Let there be -N par- 
ticles per unit volume between x and x -j- dx at the time t = 0. After 
a given time r has elapsed, we consider a volume element of the same 
size at the point x'. Particles not within this volume element at first, 
but now inside it, have entered from the neighbouring elements. The 
probability that a particle enter from an adjoining element is, of 
course, a function of the distance x' — x, and the difference in 
time t of the two observations. Accordingly, we denote it by 
f T (x’ — x). The function f r includes also the case in which the 
particles were previously in the element a:' if we set x' — x — 0. 
Since the particles must come from some volume element, the 
density at x' at the time r is 

N(x', t) = J + N(x,0)fAx' -x)dx. . . . (92) 

Introducing the displacement X — x — x', 

N{x\ t) — />(*' + X, 0)}£{X)dX. . . (93) 

The function f r is even, since positive and negative displacements are 
equally probable: 

/,(I)=/,(-Z) (94) 

We now develop the left member in powers of t, the right in powers 
of X: 

= °) + * H+ i + — }/.m ix - ■ • < 96 > 

The derivatives dN/dt, dN/dx . . . have fixed values for the time t = 0 
and the point x'. On account of the symmetry of the function f T {X), 
the integral 

rxf,{x.)dx 



600 HEAT. STATISTICAL PART [Chap. 

vanishes; further, since each particle which enters must come from 
some other element, the integral 

fjr(^)dX = 1 . 


Dividing by r, we obtain from equation (95): 

3 N 1 d*N r+» 19 *N 

ST - Tr ^L. X ^ X)iX - S® x ■ 


(96) 


The integral on the right represents the mean square displacement 
X 2 , since it is the stun of the squares of the displacements, each being 
multiplied by the probability of its occurrence. Now it is readily 
shown that the mean square of the displacement is proportional to the 
time interval of observation, so that X 2 /2r is constant: if we observe 
the instantaneous positions of a particle at equal time intervals r which 
are not too short, we may look upon the/ individual displacements as 
independent, on account of the random nature of the impacts.* 
After a series of p observations the resulting displacement is 

X^ = X 1 + X 2 = X 3 + ...X*. . , . (97) 


The square of the displacement is then 

(AU»)2 = (Z x + X, + . . . X v f = XX 2 + 2SXX r X 8 . (98) 


If we now average over the displacements experienced by a large 
number of particles, the double sum will vanish on account . of. 
the equal probability of positive and negative displacements, and so 
we have 

(X®) 2 = pX\ (99) 

This means that after the time pr has elapsed we observe a mean 
square displacement of magnitude pX 2 , i.e. p times the value corre- 
sponding to the time r. Thus X 3 is proportional to the interval of 
observation. 

Equation (96) is neither more nor less than the well-known dif- 
ferential equation of diffusion, which is analogous to the equation of 
heat conduction, concentration taking the place of temperature. 
Thus, if the concentration varies from place to place, a stream of 
particles will flow from the points of high concentration to those of 

* If the observations are made too frequently, a given velocity of the particle will 
still persist after the instant of observation and so will affect the following observation. 
In a practical oase, however, it is hardly possible to make observations at such short 
intervals. 
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low concentration. We set the density of this flow proportional to 
the slope of the concentration, i.e. to — D grad N, or in the one- 
dimensional case, —D(dN/dx). The change in the number of particles 
within a volume element dxdy is due to the entry of particles from one 
side and the departure of particles from the other. At the point x a 
current 

.(-»£)* 


enters and at the point x + dx the current 


leaves. The excess of the number entering over the number leaving 
is the rate at which the particles are diffusing into the volume element, 


dN 


n 0 2 V 




dN 


-^dxdy^D^dxdy or W =D 


( 100 ) 


Thus (cf. (96)) D = ^ (101) 

represents the diffusion coefficient of the particles. If the particles 
are in a field of force, e.g. the earth’s gravitational field, the Boltz- 
mann equilibrium distribution sets in. This is given by 

N = N 0 e~“ lkT . (102) 

If the force is constant, the potential energy is given by Fx. We may 
imagine the equilibrium to result from the motion of the particles due 
to the action of the force together with a diffusion current which seeks 
to satisfy (102). The velocity of a particle which is in equilibrium under 
the action of the applied force and viscosity is given by Stokes’s Law 
(p. 222): 



where a is the radius of the particle and 17 is the coefficient of viscosity 
of the solvent. The density of this stream of particles is then 


__NF- 

6 irrja 


. . (104) 


But a diffusion current of equal strength flows in the opposite direc- 
tion, tending to keep the distribution of concentration constant. The 
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density of this flow is given by — D(dN fix). 
densities, 

Equating these two flow 

^dN NF 

(105) 

dx 6 irrja' 

On the other hand, it follows from (102), for i 

it — Fx, that 

13 N F 

N dx ~ kT 

...... (106) 

Inserting this expression in (105) and using (101), we obtain the im- 

portant equation 


D _ 1 
kT 6irr)a' 

(107) 

X* kT 

(108) 

r 3 irrja 


This is a general expression for the mean square of the displacement 
of a particle.* 

•The above theory was developed by Einstein, and the final formula (108) was 
experimentally checked by Perrin and others. Consult A. Einstein, Ann. der Physik, 
17, p. 649 (1905); E. Blooh, The Kinetic Theory of Oases, English translation by P. A. 
Smith (London, Methuen & Co., 1924). Einstein also developed a formula for the 
rotational Brownian movement experienced by particles of irregular shape. This 
formula, too, waa verified experimentally. This is discussed by Bloch, loc. cit. 



CHAPTER XXXVI 
The Classical Quantum Statistics 

1. The Quantum Condition for the Subdivision of the Phase Space of 
an Oscillator. 

As we saw in § 6 (p. 595) of the preceding chapter, the Law of 
Equipartition leads to conflict with experiment when applied to the 
calculation of specific heats. The results are entirely incorrect when 
the theorem is applied to a volume filled with radiation (see p. 622). 
In connexion with this problem, Planck was led to advance the hypo- 
thesis, in 1900, that a simple, linear, harmonic oscillator can absorb 
energy only in integral multiples of a minimum quantity of energy 
which is proportional to the frequency.* Thus the energy possessed 
by such a system is always an integral multiple of the quantity Jiv. 

u„ — nhv (1) 

The factor of proportionality, h, is the most important of all con- 
stants in atomic physics. It occurs universally, and is not connected 
merely with the linear oscillator, as might be inferred from the manner 
in which it was introduced. Its numerical value is 
h = 6-62 X 10~ 27 erg sec. 

This constant is usually referred to as the Planck quantum of action, 
since it has the dimensions of action (energy X time). We encountered 
this constant for the first time in the theory of discharges in gases 
(p. 432). The greater the natural frequency v, the greater are the^ 
intervals in energy content, and the greater is the difference between 
the quantum theory and the classical statistics, where a continuous 
series of energy states was tacitly assumed. 

What is the significance of the Planck hypothesis for the phase 
space of a system consisting of oscillators? It is obvious that for an 
•arbitrary division into cells there can no longer be equal probability 
for cells of equal size, for if we mark the points in which the oscillator 
has exactly the energy nhv, it will always be possible to find volume 
elements containing no points (i.e. elements whose probability is zero) 
if the subdivision is arbitrary. Nevertheless, it is possible to preserve 
* M. Planck, Ann . dtr Phyaiic , 4, p. 653 (1901). 
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the validity of the Liouville theorem by subdividing the phase space 
in a particular way. For the linear oscillator, which has but one posi- 


tion co-ordinate and one momentum component, the phase space is 
the* pj-pkne. The potential and kinetic energies are 


respectively. In an 
have the relation 


or 


energy state in which the total energy is nhv we 



( 2 ) 

(2') 


This is the equation of an ellipse whose semi axes are 
and 

This ellipse is termed the phase orbit, since the phase point traverses 
this path once during a complete vibration of the oscillator, as long 
as the energy remains constant. The area enclosed by the phase orbit 
is given by the phase integral (cf. p. 129): 

J — j) pdq — abrr — 2 rrnhv^j (3) 

Since vz= /-• 

2 7r V m 

this becomes 

J = nh. . (4) 


In the case of the oscillator, the Planck hypothesis leads to the condition 
that only those states occur for which the value of the phase integral is 
an integral multiple of h. This is the general formulation of the quantum 
condition. It is only by chance that for the oscillator this condition is 
equivalent to the energy condition; for other systems the two con- 
ditions are quite distinct. If we now draw the ellipses corresponding 
to n = 1, 2, 3 ... we obtain a division of the plane into elliptical 
rings, each of area h. If we ascribe to each of these rings the energy 
value corresponding to its inner boundary, the innermost ring has 
the energy 0, the next one 1 hv, &c. Thus we again arrive at a division 
of the phase space into elements of equal size; if all quantum 
states are looked upon as equally likely, each element has idle same 
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a priori probability and we can proceed as in the classical statistics. 

If the elementary particle has several co-ordinates, the condition 
that the phase integrals be integers holds for each co-ordinate, and 
the element of phase space becomes A*. Thus the value of the dimen- 
sional constant [A] introduced on p. 688 is determined. We write A 
for it in the future. 

The chief difference between the present procedure and the classi- 
cal statistics is that the prescribed division of phase spaceJnto cells 
of definite size no longer permits the replacement of the partition 
sum a by an integral, since the latter implies a continuous change of 
energy. We thus obtain for the partition sum of the oscillator (p. 587) 
a = A S e-^ kT = A (1 + «-*■»*• + e -2*W*T + e -s*w*r (5) 

The sum/of this geometrical progression is 



This expression for a has important applications to specific heats, as 
will be shown in the following section. 

2. Temperature Variation of the Vibrational Contribution to the Specific 
Heats. 

(a) Diatomic molecules 

The two atoms which constitute a diatomic molecule are not 
rigidly connected to one another, but execute -vibrations about their 
common centre of gravity in the direction of the axis of figure. The 
evidence for these vibrations is obtained from observations on band 
spectra (cf. p. 680). Why do not the two degrees of freedom of vibra- 
tional. kinetic and potential energy contribute to the specific heat, 
which apparently comes out correctly when only the translational 
and rotational degrees of freedom are taken into account? The quan- 
tum theory furnishes information on this point. With regard to the 
vibrational contribution to the total energy, wo can consider the 
diatomic gas to be a system of linear oscillators, since only vibrations 
along the axis of figure of each molecule are possible. We have already 
calculated the partition sum for an oscillator. The free energy of one 
mol of such oscillators is then (p. 688) 

/ = -kLT\og^-RThgh + RTlog(l-e-^ kT ) + RT\ogh, (7) 
whence, by differentiation. 


. ( 8 ) 
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The energy then becomes 


Rhve -M™ _ 


*(1 


Rhv 1 

T (ef b ‘ lkT - 


For high temperatures, the exponential function in the denominator 
may be developed in a power series ending with the second term, thus 
giving 

u = RT=L.2~ (10) 


'But this is the value of the energy corresponding to the two degrees 
of freedom according to the equipartition of energy. The classical 
statistics remains vaUd when the temperature is high-, at low tempera- 
tures the finite differences of the quantum states manifest themselves 
by the fact that their contributions are not as large as the classical 
value. The contributions vanish entirely for very small values of T, 
and this degree of freedom is “ ankylosed ” (p. 596). It is evidently 
a question of the ratio of the temperature to a temperature © which is 
characteristic of the oscillator: 



where we introduce the function 

< 12 ‘> 

For small values of x, i.e. for high temperatures, the value of <X> ap- 
proaches unity, while for large values of x, i.e. for low temperatures, 
vanishes. At room temperature (about 300° abs) the state of affairs 
is this: 


For * > 10, O(x) < 0-005; 0 > 3000°, 

i.e. v > 6-2 x 10 13 sec. -1 or A < 4-8 p. 


For x < 0-5, <D(x) > 0-975; 0 < 150°, 

i.e. r<3-lxl0“ sec." 1 or A > 95 /*. 

The accompanying wave-lengths are those of electromagnetic waves 
of the given frequencies; these are measured spectroscopically as 
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the infra-red'characteristic radiation of the system. Thus we see that 
an infra-red characteristic vibration whose wave-length is less than 
4*8 fj. is practically absent at room temperature. This is actually the 
case for diatomic gases. The function (X) steadily decreases from 1 to 0 
as x increases from 0 to oo . Hence, according to equation (12) there must 
be an increase of the specific heat as the temperature is raised. This is 
observed for HC1, in good agreement with the theory. If we insert 
in (12) the frequency corresponding to the infra-red characteristic 
vibration at A = 3*47 ft, there results 


1500° 

2000° 

2600° 


(b) The solid state 

If we represent the structure of a solid by atoms vibrating about 
centres of equilibrium, we must remember that there are three direc- 
tions of vibration; we used this fact in deriving the Law of Dulong 
and Petit. This means that one mol of a solid substance is equivalent 
to 3 L oscillators, and we obtain for the molecular heat of a solid, 
exactly as above, 

3m ($y ■ • <“> 

In this instance the characteristic temperature 0 is generally so low 
that all vibrations are fully stimulated at room temperature, and so 
the limiting value 3R holds for c,. Since the mass occurs in the de- 
nominator of the formula for v, it is only for the lightest elements that 
the characteristic frequency is so great that the full value of c. is not 
attained at room temperature. 

These Bimple considerations give a good qualitative representation 
of the variation of the specific heat of a solid, but cannot be connected 
with ip individual characteristic vibration. If we take several charac- 
teristic frequencies, we obtain so many free constants that the formula 
is merely of the nature of a good interpolation formula. It is evident 
that the assumption of a single frequency resulting from independent 
oscillators cannot lead to quantitatively correct results, for the oscil- 
lators are by no means independent. In fact, a disturbance applied 
to one lattice point is transmitted to the neighbouring .points, for we 
are dealing with a system of coupled oscillators. The calculation for 
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this case is rather troublesome, but has been carried out by Bom and 
Thirring for simple lattices.* 

Ex. 129. Determine, according to the conditions given above, the quantum 
levels of the rotational motion of a rigid dnmb-bell (diatomic molecule) of 
moment of inertia I. Investigate the falling off in the rotational part of the 
specific heat of B t . 


3. The Debye Theory of the Specific Heats of Solids. 


The law of variation of the specific heats of solids is obtained with 
much less calculation by a method due to Debye. In Debye’s theory 
the atomistic structure is thrust into the background and the coupled 
vibrations of the system of oscillators are considered to be elastic 
waves which traverse the body and are reflected at its boundaries, 
thus giving a system of standing waves. A wave corresponding to a 
given frequency has the same deter mining properties as a linear 
oscillator; firstly, a frequency v, and secondly, two energy components, 
corresponding to the potential and kinetic energy of the oscillator. 
For elastic waves there is a part corresponding to compression and 
one to motion; for electromagnetic waves we have the energy den- 
sities of the electric and magnetic fields- These properties completely 
determine a longitudinal wave, but for transverse waves the plane 
of polarization of the vibrations must be specified also. This may be 
determined by resolving the amplitude along two mutually perpen- 
dicular directions. Thus an unpolarized transverse wave corresponds 
to two oscillators, one for each component of the vibration. 

Following a method given by Rayleigh and Jeans, we now proceed 
to calculate the number of standing waves which can be contained in 
a body whose volume is F. For simplicity, let this be a cube of edge l. 

Imagine a wave travelling parallel to the s-axis. This wave will 
travel to and fro between the walls which are parallel to the yz-plane. 
A system of standing waves results from the superposition of the inci- 
dent and reflected waves. There is either a node pr a loop (antinode) 
at the wall, depending upon the nature of the waves and of the walls._ 
It is immaterial, in the following computation, whether one or the 
other condition exists. For electromagnetic waves and metallic re- 
flecting walls, for example, there must be a node of the vector E at 
the wall. In the problem of the specific heats we deal with elastic 
waves representing the vibrations of the lattice points. Only those 
waves can travel in the ar-direction for which l is an integral multiple 
of half the wave-length (cf. fig. 1). Calling the phase velocity c, we have 


. nX nc 

Tv 


(U) 


* Consult M. Bom, The Mechanics of the Atom, Eng. trans. and revision by J. 
Fisher and D. Hartree (London, Q. Boll & Sons. 1927). 
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For a cubical body, the same relationships are valid for waves 
in the y- and z-directions. But these are not the only posable 
standing waves. Consider, for example, a wave whose normal 
n — +y cos j8 + k cos y lies in the yz-plane. 

This wave will be reflected from the xy-plane, 
and we must consider the reflected wave as 

well as the incident wave. Now the super- fi (§ 

position of the two is, by p. 64, equivalent to f><t- i 

a wave moving parallel to the y-axis with the 

phase velocity c' = c/cos fi, whose amplitude in the direction of the 
z-axis is modulated with the wave-length A/ = A/cosy. The boundary 
condition for the planes parallel to the zz-plane thus demands that 

2 lv 2lv cos fi 

___ _ , 

should be a whole number, and the boundary condition for the faces 
parallel to the zy-plane requires similarly that 

21 2 lv cos y 

X t '~ c 

should be an integer. 

Consider now a wave whose normal direction 13 quite arbitrary: 
n — cos ai + cos fij + cosy&. The superposition of the incident wave 
and that reflected from the yz-plane yields 

Ji -)- A’ — ^4 0 e«"(*-(* c <» a +y co»/3+»co»y)/e)^_ J o ^a(t+ xcosa.lt- (yco*ff+*co*y)lc) 

— cos 008 <*“<*-<* “•/?+*«« r)/«). 

This wave, whose front is modulated in the direction of the z-axis, is now 
reflected at the zx-plane; superposition of the two waves then gives 

A + A’+ A" = 2A 0 cos “•>"> 

+ 2 A 0 COS giu(i +y cot fi/c—* tea y/c) 

- 4 A 0 COS COS J^t-icocy/c)' 

This wave front with lattice-like modulation, when combined with the 
wave reflected from the zy-plane, gives the system of standing waves 
^ + 4 '+ 4 " + A"' 
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The f ulfilm ent of the boundary conditions at the walls requires that 

2v2cosa 2vl cos B 2vl cosy 

c = n i> — ~ — ~ — ”«» — ^- 2 = »s. • ( 16 ) 


ciu „ cm, cn~ 

or C0Sr= d- 

Squaring and adding, 


Sl SI s 4 V 2 ! 2 

V + « 2 2 + »a* = 


(16') 
■ (17) 


To every frequency v there correspond triads of positive integers which 
satisfy (17), and to each triad there corresponds one direction which 
may serve as a wave normal. We now seek to determine the number 
of waves between the frequency limits v and v + dv. In order to 
compute this number we need only look upon Mj, n 2 , and » 3 as rect- 
angular co-ordinates. Then (17) represents the equation of a sphere 
of radius 2 vl/o. Since one integral lattice point corresponds to each 
cube of edge unity, the number of allowed vibrations in the range v to 
v + dv is evidently equal to the volume of one octant* of the spherical 
shell included between the radii 2vljc and 2(v + dv)l/e. This is 




4 7r . 4v a P . 2 dv 4irv 2 Z 3 

- 




(18) 


or, per cubic centimetre of the body, there will be 

dZ 4 nv 1 , na ,. 

■f =~jr dv ( 18 > 

vibrations between the limits v and v + dv, and this number is inde- 
pendent of the particular form of the body, which fact may be rigorously 
demonstrated. 

For the .elastic waves corresponding to the thermal agitation of 
the scorns of a solid we have (p. 181) longitudinal as well as transverse 
waves. These have different velocities, which we denote by c, and c t 
respectively. On account of their two azimuths of polarization, the 
transverse waves must be counted twice. Accordingly, we have for 
the total number of waves in the volume v of one mol, between the 
frequencies v and v + dv, 

dz^bnvvtQt+^jdv ( 19 ) 

According to this formula, the number of waves increases as the square 
of v and without limit, so that the number of oscillators becomes 
infinite, while in reality one mol of a solid contains only 3 L oscillators. 
Debye arbitrarily assumed that the total number of waves should be 
* Since only positive integral »< are allowed. 
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precisely 3 L. This assumption, which at first sight seems rather 
forced, is justified by its results. Moreover, it results quite automati- 
cally by the rigorous computation for a system of coupled oscillators 
(see end of the preceding section). The total number of waves is ob- 
tained by integrating over all frequency intervals from 0 to the 
limiting frequency v„. Thus v„ is determined by the equation 


C w C? + ?) ir ~ x C? + £) - sl - 


(20) 


If we treat each wave as an oscillator we can proceed exactly as 
§ 2 (p. 605). The intrinsic energy of a system of N oscillators 
frequency v is, by equation (9) (p. 606),* 


U 


Nhv 


Thus for dz oscillators we have an amount of energy 

. , /I , 2\ h^dv 9 L hifidv 

*’V + ^)fnsr=i- ? jnsrr-i- 

and the total energy of one mol is 

9 Lh r v t ifldv 


vLh f v t i *dv 

U ~'1JJ 0 e>"*T-r 


We introduce the following variables into the integrand: 

hv hv„ 0 


= hT k = 


= kT r 


( 21 ) 


(22) 


(23) 


Thus v 0 enters into the characteristic temperature in place of the 
characteristic frequency v of the preceding section. We then have 

where T(*) = ^jf"-^. (24) 


For small values of sc, i.e. high temperatures, development of 'the 
denominator in a power series yields 






( 25 ) 


* In equation (9) we were dealing with L — Rile oscillators. 


S.B' 
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If the series is terminated with the first term, differentiation again 
gives the specific heat in harmony with the Law of Dulong and Petit. 
The higher terms thus give the drop in the specific heat for smaDer 
temperatures. 

For the lower portion of the curve c„ =f(T), i.e. for large values of 
x, we can write « as the upper limit, since the high values of x con- 
tribute very little to the integral on account of the occurrence of e* 
in the denominator. The power series development of a?(e a — l) -1 
yields 

a?(e* — l)"* = fer*(\ — e~ a )~ l = aAr* + a^r 2 * -f aftr** + . . 
and term by term integration between 0 and co gives the series* 

The sum of the series 1,(1 jn*) is n*/90 — 1*0825. Finally, in the neigh- 
bourhood of the zero point, we have 

u = 58*45 ST. ...... (27) 

and for c, near T— 0, 

c. = 233-85 fi|p (28) 


This is the empirically well substantiated T 3 law for the specific heat 
of a solid. 

For farther details, consult P. Debye, Ann. der Physik, 89, p. 789 
(1912). 


•- - sm:* 



CHAPTER XXXVH 


The Theory of Thermal Radiation 

i. Analogy between a Gas and Radiation filling an Enclosure. Radia- 
tion Pressure. 

The experiences of everyday life show that a hot body radiates 
light, i.e. electromagnetic waves. The problem of calculating the total 
amount of radiation emitted by unit area of a surface per second, and 
of determining its spectral distribution, is thus as much a matter for 
the Theory of Heat as for Electromagnetism. 

First, we must ask whether the radiation of a hot body depends 
only upon its own properties and not also upon those of the environ- 
ment. At first sight it might appear that, in analogy with thermal 
conduction, the radiation sent to a cool neighbouring body is more 
intense than that sent to % warmer body. An important, advance in 
the solution of the problem formulated above was madeby Provost 
when he recognized that the rate of radiation is independent of the 
surroundings, and that the apparent dependence upon the environ- 
ment is due to the fact that the cooler body in turn sends energy back 
to the radiator, but at a smaller rate than does a hot body, so that the 
side of the radiator which is presented to the colder body cools off 
more rapidly. In this manner the number of variables upon which 
the radiation of a hot body depends is considerably reduced. Further, 
we shall find that the radiation from any body may be computed if 
we know that of a perfectly black body, and if we* know the absorp- 
tion spectrum of the given body. 

Finally, the problem of calculating the radiation from a perfectly 
black body 'is solved if we are able to compute the distribution of 
energy in a space bounded by walls which are impermeable to heat. 
In the-Theory of Radiation such a space is usually called a cavity or 
enclosure. Let the walls be at a uniform temperature T. Such a cavity 
filled with radiation (the whole space is continually being traversed 
by electroma gnetic waves radiated from the walls) has many points 
of similarity to a vessel filled with a gas. It is true that there is one 
point of difference: the density of a gas enclosed in a volume F may 
be changed at will, so that we have fie density (or pressure) as a free 
variable, in addition to the temperature. On the other hand,' the 
018 
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energy density of the confined radiation is already completely fixed 
by the temperature. But just as the atoms of a gas exert pressure on 
the walls, so likewise does the confined radiation. This radiation 
pressure, whose magnitude will be calculated below, is very small 
under conditions available in the laboratory, but may attain values, 
in the interior of the stars, comparable with the gas pressure. 

We know from experience that the spectrum of a glowing “ black ” 
body is continuous, i.e. all frequencies are present in the radiation. 
The problem of calculating the distribution of energy among the 
several frequencies is very similar to that of determining the distri- 
bution of the energy of a gas among the various molecular velocities. 
The superficial similarity between the experimentally determined 
energy distribution curve and the Maxwell distribution curve (fig. 1, 
p. 892) was recognized from the very first. No atom of a gas neces- 
sarily .possesses exactly any selected velocity; we can only specify 
the number of atoms whose velocities fall within a given velocity 
interval Ac. In the same way, there is not a given amount of radiant 
energy corresponding to a single frequency v, but only an energy 
interval A U corresponding to a prescribed rffnge Av, i.e. it is only 
the quotient A17/Av which is finite. To prevent misunderstandings, 
which occasionally appear in the literature, it must be pointed out 
that these considerations hold only for the continuous spectra result- 
ing from temperature radiation. The statement occasionally made that 
there can be no sharp spectral lines, i.e. finite quantity of energy at 
a single frequency v, on account of thermodynamic reasons, is erroneous. 
If the excitation is not thermal, there is no reason for believing that 
a sharp line cannot be emitted. The fact that even in this case the line 
will really have a finite (but small) width is due to other causes. 

Before attacking the actual problem of computing the distribution 
of energy in a radiation-filled enclosure, we shall deduce the value of 
the radiation pressure from the electrical field equations. We con- 
sider only the simplest case, that of a plane wave incident normally 
upon the plane surface of a perfect conductor. The ideal case a— oo 
implies a perfectly reflecting surface. An electric field cannot exist 
within a perfect conductor, for if a field did exist, it would cause in- 
finite currents to flow; we may suppose, however, for convenience of 
calculation that a is merely very great, not infinite, in which case 
there will be currents very near the surface of the conductor. Allow 
the incident wave to travel in the direction of the positive x-axis, 
and let the surface of the conductor be the yz-plane. The plane of 
vibration of E, which is still undetermined, is taken to be the xy-plane. 
The equation of the incident wave iB then 

E , « Ae" ( '~* e) . (1) 

The boundary condition concerning the continuity of the tangential 
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components of E requires that the electric field strength be always 
zero for a = 0. (For perpendicular incidence there exists no normal 
component of E whose discontinuity would give rise to a surface 
charge. There is such a component, however, for oblique incidence.) 
The boundary conditions are satisfied by adding a reflected wave of 
equal amplitude whose phase differs from that of the original wave 
by 77 , 

EJ = - Ae*°« +X,e) (2) 

The perfect reflection of the mirror is represented by the equality of 
| E v '\ and | E v \. Exactly as on p. 330, the corresponding magnetic 
vectors are determined from Maxwell’s equations: 

H H V =HJ =H„— H s ' = 0. (3) 

This indicates that the magnetio field does not vanish at the surface 
of the conductor, inasmuch as H t and H,' have the same sign. Since 
currents are flowing very close to the surface of the conductor, the 
magnetic field will exert a force on the conductor. By the results of 
p. 310, the force on a volume element dr traversed by a current whose 
density is i is given by 

dF-^d-r* (4) 

Now the vector i may be expressed in terms of curl H. Since E and 
dE/dt vanish within the conductor, the fust field equation yields 

i = curl H. ....... (5) 

Since only differs from zero, curlH has only a y-component, and 
we have 

0 dH, 

*• — (S) 

Substituting this in equation (4), there remains only an z-oomponent 
of the force: 

ir -—b a - ? £r i '- < 8 > 

Take as a volume element a cylinder of unit cross-section and altitude 
dx. To find the total force acting on this cylinder, we integrate from 
0 to oo : 

* We write dS for the force on the element instead of ?,u«. p. 310, t 
integrate over a finite volume later. 


(V 
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Since even for finite conductivity of the metal the wave penetrates 
but a very small distance, the magnetic field strength becomes zero for 
a; — oo, while, according to (3), it becomes | 2Ae iat | = 2 j H j for x — 0. 
This is the force acting on unit area of the surface; we may therefore 
call it the pressure of the radiation. Since for a plane wave in a vacuum 

| E | = | H { and |i|-£|E| | Hj, 
this pressure is, for normal incidence, 

P« = ^|B| |H| = ^ (8) 

In this formula I E |, | H [ and | S | signify the instantaneous values 
of the periodically varying quantities. What we observe, however, 
is the average in time, p^. This is obtained by inserting the time 
average of the Poynting vector on the right side of the equation. 
According to equation (60) (p. 341), this is equal to half the maximum 
amplitude, so that we have 

2 1 (8') 

Thus the pressure of radiation is proportional to the nonnalVy incident 
energy flux S. For oblique incidence we have, in addition, an electrical 
force acting on the surface charge. The final formula has cos a 0 as an 
additional factor, where 6 is the angle of incidence. 

The existence of such a pressure was first demonstrated experi- 
mentally and measured by L6bedew (1900), and in an extensive series 
of experiments conducted by Nichols and Hull (1903). The pressure 
found is in close agreement with the theoretical value. 

2. The Connexion between Energy Density and Surface Brightness. 

As already mentioned, the surface brightness is the radiation 
quantity which is experimentally measurable, while it is the energy 
density which is calculated by the theory. We therefore begin by 
finding the connexion between these two quantities. Consider an 
enclosure bounded by walls which are perfect heat insulators. Assume 
that bodies of any kind may be inserted into this space. Assume 
.further that the dimensions of these bodies are large compared with 
the wave-length of the radiation,.. so that diffraction and scattering 
need not be token into consideration. As a matter of fact, we wish to 
consider even differential segments large compared with the wave- 
length. Likewise, we wish to assume that the time intervals and their 
differentials are large compared with the period of the waves, so that 
the mean values of the radiation quantities are sufficiently defined. 
On account of the finite velocity of propagation of the electromagnetic 
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waves streaming through the space, there is a definite electromagnetic 
energy du in each element of volume. The limit du/dr is called the 
energy density p. This is spectrally analysed, so that the energy 
density p v dv corresponds to the frequency interval dv. The task of 
the theory is to determine p y for each interval as a function of the 
temperature of the enclosure. The surface brightness is defined as the 
amount of energy within the frequency interval dv radiated perpen- 
dicularly from an element of surface, per second per square centi- 
metre, within unit solid angle. This quantity is represented by 
2 Kjdv\ the factor 2 is present because the radiation can be resolved 
in two mutually perpendicular planes of polarization. Thus K v is the 
surface brightness for linearly polarized radiation. Since we are con- 
sidering isotropic bodies only, the surface brightness for unpolar- 
ized radiation is twice this quantity. ' The introduction of an angle 
of divergence (solid angle) of the pencil of rays is necessary on 
account of the fact that a source of light of finite surface bright- 
ness can produce no exactly parallel 
pencil. In order to obtain StiCh a 
pencil one must have a perfect point 
source at the focus of a lens. If the 
beam coming from the lens iB to have 
a finite intensity, the surface brightness 
of the point (i.e. extensionless) source 
must be infinite. If the normal to the surface which is emitting the 
radiation makes an angle 6 with the axis of the elementary cone, then — 
as may be seen from fig. 1 — the energy flux per square centimetre is 
reduced to cos 0 times the value for normal emergence, for at oblique 
emergence the surface concerned- in the radiation is increased by the 
factor l/eos0. This is known as Lambert’s Law. It is a consequence 
of this law, for example, that a glowing cylinder appears to be a uni- 
formly illuminated rectangle and the sun gives the impression of being 
a uniformly bright disc. The portions of the surface near the rim are 
foreshortened, but radiate correspondingly less in the oblique direc- 
tion. Lambert’s Law holds only for “black” thermal radiation, and 
is no longer valid, for example, for the X-radiation produced by the 
impact of electrons upon the anticathode; * in this case it is possible 
to obtain increased intensity by observing obliquely. 

Accordingly we have for the energy within the frequency range 
dv radiated in the time dt by the surface da. in a solid angle 
dQ. = sin 6d0d<f), 

whose axis forms an angle 6 with the normal to the surface 

d*u — d 6 u,dv = dvdtdo cos 6d£l , 2 K*. ... (9) 

As long as we consider only isotropic bodies, K r is independent of the 
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direction, and we obtain the radiation within dv emitted by da in the 
time dl by integration over the hemisphere: 

d*u — cPa„dv — 2dvdtdaK r f”'/ cos 0 si ndddd<j> 

— 2 trdtdoK y dv. . (10) 

For the total radiation we integrate over the entire spectrum: 

d?u— 2irdtda J K v dv (11) 

If, now, we consider a volume element dr within a medium which 
is being permeated by radiation, this element 
will be traversed by a largo number of 
pencils of rays originating outside it. Con- 
sider a definite portion of the neighbour- 
hood of dr bounded by a sphere of radius 
r whoso centre is at the centre of dr (fig. 2). 
Resolve the radiation sent to dr from a very 
small element of surface of this sphere into 
elementary conical pencils. The portions 
of these cones within dr may be considered 
to be small cylinders. If the cross-section 
of such a cylinder is dS, the solid angle dQ 
of the small cone is dS/r *, and the amount of energy radiated by da 
into this cone per second I 3 

^ ^2dvdaZ r < ^ ( 12 ) 

This may be looked upon as the part of the energy flux within the 
elementary cone originating at da. The density of the energy flow is 
then obtained by dividing by dS. Now for any flow, the current density 
of any quantity is equal to the product of the density of the quantity 
by the velocity of flow (see, for example, t>. 418). The latter is cjn, 
where n is the index of refractioq. of the medium. Thus the energy 
density in that part of the volume element pierced by the given cone 
which is due to the radiation from da is given by 

®P=d? P Jv= 2 ^ nKr . .... (13) 



This value holds for all parts ot the element of volume dr, since the 
other cones which penetrate dr difier infinitesimally in direction; 
hence d 3 p gives the energy density in dr due to the radiation from da. 
We then have by integration over all surface elements of the sphere 

dp = p„dv — dv. 


(14) 
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8 ; rK, 
~oJn‘ 
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(15) 


3. The Connexion between Emission and Absorption in Thermo- 
dynamic Equilibrium. Kirchhoff’s Law. 

Consider a uniform medium within the enclosure. At equilibrium, 
every element of volume of this medium will radiate a certain amount ' 
of energy each second, i.e. will convert a given amount of heat energy 
into radiation, and will absorb an equal amount of radiation delivered 
by the environment, converting it into heat. With regard to the pro- 
cess of emission and absorption, it must be remembered this takes 
place in the atoms of the substance, so that a certain volume is neces- 
sary for the emission of a given quantity of energy. A mathematical 
surface contains no atoms, and so can emit no stated amount of energy. 
It is true that we often speak of the emissive power of a surface; in 
such cases we mean the total energy which streams through the sur- 
face from the interior of the body. Hence we define the emissivity, 
or coefficient of emission e v dv as the energy of linearly polarized radia- 
tion within the frequency interval dv, referred to unit volume and 
unit solid angle', emitted in unit time. Then the total amount of un- 
polarized energy in the range dv emitted in the solid angle 4 w in the 
time dt is 

d a u t — ^TtdvdtdTSy. ...... (16) 


The absorption coefficient a„ for frequencies in the range v to v -J- dv 
is defined in the manner which is usual in optics as the fraction of the 
incident radiation absorbed per second in a thin layer, referred to unit 
thickness. Wo again surround the element qf volume by a sphere of 
radius r. The rays emanating from an element do of the sphere and 
penetrating the element of volume fill a solid angle which again may 
be subdivided into elementary conea; the portions of these cones 
within the volume element may be considered cylinders, as before. 
If these cylinders are of altitude di t and of cross-section dS s , then the 
total amount of energy absorbed in the time dt from the unpolarized 
pencil coming from do is given by 

d?u a — 2dtdv1. dl i a. r doK v , . . . . (17) 


Now 51dS t dl< is the volume of the element dr. Integration over all 
surface elements of the sphere yields the following expression for the 
total amount of energy absorbed from the environment: 

d°u a = Qrrdtdv draJL,, 


(18) 
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If we set the emitted energy equal to that absorbed, we obtain simply 

J (19) 

For any uniform medium in thermodynamic equilibrium, the falio of 
the emission coefficient to the absorption coefficient for any spectral region 
is equal to the surface brightness. This statement is often referred to as 
Kirchhoff’s Law, but the latter refers to two other quantities, especially 
in its experimental applications, as we shall see immediately. 

If the enclosure is filled with a medium which is perfectly trans- 
parent to all rays, e.g. a vacuum (or even air, as long as the system is 
at temperatures available in the laboratory), then while it is true that 
K r is defined by the energy passing through a surface, the fraction 
e,/a y is of the form 0/0. But the enclosure is bounded by walls of some 
kind, from which the radiation comes. We therefore investigate the 
radiation exchange at the boundary. The amount of energy passing 
through an area do of the wall within a solid angle dCl whose axis makes 
an angle 0 with the normal^ in the time dt, is 

= 2dvdtdod£l cos OKy. .... (20) 

Since there is thermodynamic equilibrium, the same amount of energy 
must pass through this same surface element in the opposite direction. 
Of this energy flow, only a part determined by the emissive power E, 
will come from the interior of the material constituting the wall, the 
rest being radiation which is reflected by the wall. The fraction of the 
incident radiation which is absorbed in the substance of the wall is 
called the absorptive power A v for the frequency range v to v -f dv. 
The part 1 — A, is reflected at the surface of the wall. The energy 
equation at this surface is then 

2dvdtdodQ cos OK, = (1 — A,)2dvdtdod£l cos OK, 

-f 2dvdtdodilcos8E„ . (21) 

whence K ^T ( 22 ) 

This is Kirchhoff’s Law. The ratio of the emissive power to the absorptive 
power of a body is equal to the surface brightness in the adjacent diather- 
manous medium. The difference between the concepts of emissive and 
absorptive power introduced here and the emission and absorption 
coefficients is that the former depend upon the thickness of the 
body. Even if the absorption coefficient is small, a sufficiently deep 
layer can destroy practically all the incident energy. A body for 
which A, = 1 is called a perfectly black body. For example, a sphere 
of gas — representing, say, a star — may be considered a black body on 
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account of the extremely great depth of material, although the absorp- 
tion coefficient of the gases is very small. . 

The fact that the emissive power of a black body is equal to the 
surface brightness in the adjacent medium makes possible the con- 
struction of a black body in a simple way. While black substances 
like platinum black absorb visible light completely, they are no longer 
“ black ” for longer waves, so that the use of such substances as black 
bodies may lead to deception. According to what has been said, a 
black body free from these objections is obtained by taking a large 
enclosure in the wall of which there is a small hole whose diameter 
must be so small compared with the size of the container that the 
radiation passing outward through the hole is negligible; its surface 
brightness is that which obtains in the diathermanous medium. The 
surface of this hole then represents that of a perfect black body. The 
effectiveness of this device may be seen by cutting a small hole in the 
side of a box made of black paper. The hole appears to be much 
blacker than the paper. In fact, the emissive power of a black body 
is experimentally determined by measuring the surface brightness of 
a small opening in the side of a uniformly heated furnace. 

According to EirchhofE’s Law the thermal emission of any other 
body is obtained, for any spectral region, by multiplying the black 
body value by the spectroscopically determined absorptive power. 
The emissivity of a non-black body must be less than that of a black 
body for all colours, and can attain the black body value only in the 
regions of strong selective absorption. In instances where the emis- 
sivity exceeds that of a black body, e.g. for a gas mantle, the excitation 
is not purely thermal. Here, for example, the gas in the region of com- 
bustion is by no means in thermal equilibrium, and the glow is due in 
part to chemiluminescence. The same is true of “ cold ” flames, e.g. 
the zone of reaction of a sodium vapour jet in chlorine, which gives 
the yellow sodium line with considerable intensity at temperatures 
a3 low as 600° C. 


4. Planck’s Law of Radiation. 

The relation between the emissive power of a black body and the 
energy density of radiation in an enclosure having been made clear 
in the last two sections, it remains only to calculate the energy density 
according to the methods of statistics. All the necessary computational 
work has already been done in Chap. XXXVI, §3 (p. 608). Consider the 
enclosure to be a cube of edge l. Since the electromagnetic waves 
traversing this space are pure transverse waves, the number of possible 
frequencies in the range v to v -f- dv is again given by 


dZ 


8rrV r 2 

~1F~ 


(28) 
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Each wave is statistically equivalent to an oscillator, and since, by 
equation (9) (p. 606), the energy of a system consisting of N oscillators 
of frequency v is 


V = 


Nhv 

iSwTr_l« 


( 24 ) 


the energy density within the interval dv, per cubic centimetre, is 
, SwAv 3 , /oe , 

P* Av = i) Av (28 > 

Hence for unpolarized radiation the emissive power of a black body, 
in the range dv, becomes 

2E v dv = 2K v dv = ^^— T) dv. . . . (26) 

This is the celebrated Planck Radiation Law. The curve E v {v) lfas 
the typical form of a Maxwell velocity distribution curve (cf. fig. 1, 
p. 592): first, a steep increase to a maximum, then an exponential 
decline toward higher frequencies. This exponential decay is charac- 
teristic of the Quantum Theory. The Rayleigh- Jeans Law, based on the 
equipartition of energy, requires the number of waves in a given 
interval to increase as v 2 , so that it would yield a continued increase 
of energy toward high frequencies, i.e. toward the ultra-violet, which 
is contrary to experience. 

If it is desired to express Planck’s Law in terms of the wave-length, 
it must be remembered that for a spectral interval expressed in dv 
and again in dX, 

Epdv = E x dX, 

and that I dv | = ^ | dX |. 

The radiation law (26) then becomes 

2E,dX = ?^. ? J r -~ 1 dX (-27) 

Fig. 3 is a graphical representation of the content of this equation. 
The Law of Planck contains all other laws of radiation. The Wien 
Displacement Law, which was derived from quite different con- 
siderations by W. Wien before the Planck law was known, relates to 
the position of the maximum of the energy distribution curve. It is 
especially simple when expressed in terms of A: differentiating the 
right member of (27) and setting the result equal to zero, we have 
10Ac 2 2 AcV* /An he 

A V“ /fcrA — 1) + A 5 (e Ac/ * rA - l) 2 ' IT A 2 ~ 
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Eliminating the common factors and introducing the variable 
x — hc/kTX, this becomes the transcendental equation 


We see at once that there must be a root in the neighbourhood of 5. 
Applying the usual methods of approximation, the exact value is 
found tcKbe 4-965. This is the only real root, as may be seen readily 
by drawing the curve. Hence we have 

A_ - 

4-965M” 

or £^ = 6 = 0-2896 cm. deg. . . (28) 


This is Wien’s Displacement Law, which states that as the temperature 
of a black body is increased, the position of maximum emission moves 
in the direction of shorter waves in such way that the product X^T 
remains constant-. 

The total radiation is obtained by integrating over all frequencies: 


2E 


2 f\dv = 

- / o 


2 h r Jdv 2i*T* r® x*dx 
<? J 0 ' ghrlkT _ l c2/,3 J 0 e * _ i‘ 


This integral has been evaluated on p. 612, where itB value was found 
to be w 4 / 15. Thus we have 


(29) 


This is the Stefan-Boltzmann Law, according to which the total energy 
emitted by a black body varies as the fourth power of the absolute tem- 
perature. Usually, however, we do not give the energy radiated nor- 
mally into a unit solid angle, but rather the total energy radiated on 
one side of a plane, i.e. into a solid angle 2w. The constant of pro- 
portionality o so defined then has the value no', according to equation 
(10) (p. 618),' so that 

a = ^3 “ 6 ’ 68 X 10-5 • 

1 ocrh 3 cm. 2 deg. 4 sec. 


Ex. 130. Using a cyclic process, deduce the Stefan-Boltzmann Law, leaving 
the factor of proportionality undetermined. The procedure resembles that by 
which the vapour pressure formula was obtained (on p. 534 et seq.). Show also 
that the pressure of radiation amounts to p = p/3. 



CHAPTER XXXVIII 

The Bose-Einstein ani> the Fermi -Dirac Statistics 


1. Preliminary Remarks concerning the Duality of Waves and Cor- 
puscles. 

In the preceding chapter we obtained the law of distribution of 
energy in an enclosure filled with, radiation by counting the number 
of electromagnetic waves which could exist there. Now there are 
numerous phenomena where radiation is ‘converted into energy which 
is connected with matter — kinetic energy of electrons, chemical energy, 
&c. In such instances light behaves as though it consisted of discrete 
corpuscular elements of energy, light quanta of energy hv. On the other 
hand, the corpuscular idea is not capable of including all properties , 
of radiation, for an immense number of interference phenomena 
demonstrate the wave-like character of light. In Part VII of this 
work we shall bring out how the wave concept and the particle con- 
cept are two aspects of the same thing, and how the facts may be 
represented only by a combination of < both ideas. We shall ’arrive at 
the conclusion that the light quanta are guided by the wave field, 
and that it is not possible to specify the positions of the quanta inside 
a wave. In view of the analogy between an enclosure filled with radia- 
tion and one filled with gas, it is to be expected that a corpuscular 
theory will also yield the radiation law correctly. When, however, 
we calculate the energy distribution of the quanta according to the 
Boltzmann statistics, we obtain the incorrect radiation law in which 
the energy is displaced into the short wave end of the spectrum. The 
underlying reason for the failure of this method is that we assume the 
light quanta to be independent of one another when applying the 
Boltzmann method. This independence certainly does not exist; if 
two coherent waves (bundles of quanta) interfere, this implies a mutual 
influence between the quanta, leading to a decrease in intensity at 
certain points and an increase at others. At any rate, a displacement 
of quanta occurs.* The correct radiation law is obtained by a new 
statistical method introduced by Bose. 


* It la well to remember that there is 
lerenoe phenomena. 




“ destruction ” at light in Inter- 
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Further, a number of recent experiments indicate that it is 
also incorrect to assume the independence of particles even when 
dealing with electrons and with entire atoms. There are guiding waves 
for these particles too, and these waves exhibit the properties of 
diffraction and interference. These ideas will be discussed in greater 
detail in Chap. XL (p. 687). At this point we wish to say only 
that the “ statistics of direct intuition ” — as we may call the Boltz- 
mann statistics — is not applicable. We may then build up a new 
form of statistics either in terms of waves or in terms of corpuscles. 
The wave form would have the advantage that its deductions are more 
illuminating, once we assume the existence of the guiding waves. 
Nevertheless, the results are often needed in the corpuscular form. 
It is preferable, therefore, to employ the statistics in terms of cor- 
puscles; this is the form deduced by Bose, by means of a happy arti- 
fice, for light quanta. For ordinary gases the difference between this 
and the classical statistics is so small that it has not yet been found 
possible to distinguish between the two experimentally. For an elec- 
tron gas, however, the situation is such that for densities of the order 
of that attributed to the electron gas inside metals the results accord- 
ing to the two views are totally, different. For the case of electrons, 
however, the Bose statistics must be supplemented so as to conform 
with a certain general principle of atomic structure. In this 
way we arrive at a new form of statistics due to Fermi — a form 
which is capable of solving many of the problems relating to electrons 
in metals. 

2. The Bose-Einstein Statistics. 

The essential characteristic of the statistics devised for light quanta 
by Bose * and applied to actual gases by Einstein f is that no dis- 
tinction is made between individual atoms, even in the micro-states. 
In the case of light quanta it seems artificial to distinguish between 
particles; on the other hand, the calculation of probability as carried 
out m classical statistics seemed reasonable and natural, because we 
are accustomed to the concept of individual atoms. The phenomena 
of the diffraction of atoms (p. 695) compel us to give up the idea of 
individually distinguishing the atoms or molecules of a given substance 
also. The way in which this alters the computation of probabilities 
will be illustrated by a very simple example. Suppose we wish to dis- 
tribute three atoms among three cells. A macro-state of classical 
statistics is determined by giving the number of atoms in the first, 
second and third cells. We may summarize the various distributions 
as follows: 

* 8. N. Bose, ZeiUch. f. Phy». 26, p. 178 (102*). 

t A. Einstein, Bari, Bar. p. 261 (1924). ibid. p. 18 (1926). 
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\ Group 

on \ 

I 

II 

HI 

1 

3 0 0 

2 0 12 10 

1 

2 

0 3 0 

1 2 2 0 0 1 

1 

3 

0 0 3 

0 10 12 2 

1 

Boltzmann Probability 

I 1 1 

3 3 3 3 3 3 

6 


As long as we look upon the atoms as distinguishable one from 
another, the distributions of the first group can be realized in but one 
way, the second in three ways, and the third in six ways — for a dis- 
■tribution in which Atom 1 is in Cell 1 and Atom 2 is in Cell 2 is taken 
to be different from a distribution in which Atom 2 is in Cell 1 and 
Atom 1 is in Cell 2. In Bose’s statistics no such distinction is made, 
so that each of these distributions has the same probability, and, 
according to Bose, a distribution of this kind represents only a mioio- 
state. What, then, is a macro-state in the Bose statistics? A macro- 
state may be represented by various micro-states, and the number of 
these gives its probability. It is natural to discard the distinction 
between the cells and to characterize the macro-state by specifying 
how many cells are occupied by 0, by l,.by 2, &o. atpms. Hence in 
the Bose statistics the macro-states are the groups bearing Roman 
numerals in the above tabulation. Thus State I has the probability 3, 
State II the probability 6, State III the probability 1. It is to be 
noticed, however, that we are permitted~to give up the distinction 
only between cells which are physically equivalent, e.g. those of equal 
energy. For instance, if we wish to compute the velocity distribution- 
of the molecules of an ideal gas according to the new method, the zone 
ox the phase space — to which a given interval of molecular energy 
corresponds — already contains so many cells that we may perform 
the above probability computation within it. The general formula 
for the probability of a macro-state defined in such a way that of a 
total of Z cells Z 0 are occupied by 0 particles, Z x by 1, . . . Z„ by w, 
is evidently obtained from the following scheme by permuting the 
numbered cells instead of the numbered particles as heretofore: 

Schematic Representation of a Micro- state in the 
Bose Statistics 

Cell number: 1 2 3 4 B 6 

Occupied by : 0 1 0 2 2 3 partioles 

Macro-state: Z 0 =2, Z 1 =l, Z 2 =2, Z 3 = 1, Z„=0 for n>3 
Should it happen that two cells are occupied by the same number 
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of particles, the interchange of their numbers will yield no new com- 
plexion, and hence the probability becomes 

Z! 


where £ Z„ is equal to Z, the total number of cells and £nZ„ is equal 
to N, the number of particles. 

Let us apply the method to the calculation of the entropy and 
distribution of velocities in ah ideal monatomic gas. Since each mole- 
cule has three position and three momentum co-ordinates, the size 
of each cell of phase space must be taken to be A 8 , as in classical statis- 
tics. Since the energy does not depend on the position, we can split 
up the phase space into the product of momentum space by position 
space, just as we did on p. 590 — i.e. we take the element of volume of 
momentum space, which alone concerns us in what follows, equal to 
h 3 

At, = (2) 

where V is the volume available to the molecules.* The energy of 
a molecule is given by 


Hence the value of the energy is constant over a sphere in the momen- 
tum space. If we divide this space into spherical shells of thickness 
A p, there will be a large number of elementary cells of size h 3 /V in 
each spherical shell; the Bose statistics can be applied to the distri- 
bution in the several shells, since the cells of each Bpherical layer 
possess equal energy. Distinguishing the shells by the subscript s, 
the probability of a state in which Z„, cells of the sth shell each con- 
tain n molecules is given by 

w,== nlj. w 

According to the Theory of Probability we obtain the complete prob- 
ability of any distribution by taking the product of the separate 


* Since At — AxAyAzApx&PyApt, where At ia the volurno element of phase space 
it may seem at first sight that we have to set At v - A*/A F, but since the Quantum 
Theory prescribes only the volume, not the form, of the element, we can extend the 
domain of the position co-ordinates, upon which the energy does not depend, at the 
expense of the range of the momentum co-ordinates. The deeper reason for being 
able to proceed in this manner is to be found in Heisenberg’s Principle of Uncertainty 
(of. p. 890), according to which the product of the possible variations of a position 
and a momentum co-ordinate is equal to k. Thus, if we give the position oo-ordinate a 
large range of variation, the momentum co-ordinate is, correspondingly, more sharply 
defined. Hence, since the energy does not depend on position in the present instance, 
we can take the entire volume as the rang} of the position co-ordinates. 
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probabilities. Assume the Boltzpiann equation to hold for this 
quantity: 

5=ilogPF == ilogll W, 

= JfeS logZJ - logZJ 

= k-ZZ,\ogZ,-k'ZZ,-k'LXZ na logZ„,-f JfcZSZ,,,, 
and since S Z n , — Z„ 

j S=AEZ,logZ,-*2SZ n .logZ tM . .... (5) 

For the condition of equilibrium, W must again be a maximum, with 
the auxiliary conditions 

XZ ns = Z, = const., ...... (6) 

for the total number of cells in each spherical layer is, of course, fixed 
by the geometry alone. Equation (6) then amounts to a number of 
auxiliary conditions equal to the number of shells. Further, we must have 
ZEnZ„, = N, (7) 

where N is the total number of molecules, and 

2Snu,Z B ,= £7, (8) 

where V is the total energy and u, is the energy of a molecule whose 
representative point lies in the sth cell. We again determine the ex- 
tremal value of the entropy under the given auxiliary conditions by 
applying Lagrange’s method of multipliers. In taking the variation 
of the number of particles in each cell it must be remembered that 
the total number of cells in each layer is fixed, i.e. the numerator of 
the fraction in equation (4) is a constant. Then, from (4) and (5), 


0=SS8Z B ,(logZ fl ,+ l) = 0 (9) 

The variation of the auxiliary conditions gives 

S8Z„,= 0 (10) 

for each shell; moreover, for a constant total number of particles and 
constant intrinsic energy, 

E2n8Z nl = 0, (11) 

and 22nw,SZ n ,= 8ET=0. (12) 


We now multiply each of the equations of (10) by a coefficient which 
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we take to be — (log a, + 1), and multiply (11) by /? and (12) by y. 
Adding these to the main equation (9) and setting the coefficients of 
SZ„, equal to zero yields 

k>gZ„ e — logc^j nf3 + nu,y = 0, 
or Z„, = a s e- B «>+* , *> (13) 

The multipliers a, may be determined by introducing the number of 
cells in each shell: 

. ( 14 ) 

The last form is the sum of the geometric progression. Then 

a,= 2,(1- «-»+!■*») (14') 

For the number of particles in each shell — which is the goal of our 
computation — we have 

N,=. Y,nZ n ,— a,Sne-”0>+r“*)r= -0,570— rSe—^+W 

« B * '» *9(/?4- yw 8 ) « 

d ( 1 \ a '€-»+*> 

a, 9()3 4- yw,)\l — g-^+rx.y {l - 1 1 


Inserting the value of a„ this becomes 

# _ Z > 

* c (0+v «,) — r 


(16) 


It remains only to determine the multipliers £ and y. Since y is multi- 
plied by the energy of the cells, the total energy is a function of y and, 
reciprocally, y is a function of the total energy U. Inserting (16) 
in the entropy equation and taking the derivative (95/317),= 1/T, 
we obtain, exactly as on p. 588-, the value 1/JcT for y* In order to 
determine /3 we must turn to equation (7), which gives the total number 
of particles. The summation or integration, however, involves com- 
putational difficulties, so that we shall leave ft undetermined. Setting 
e* 9 = 5, we have the distribution formula 


Z, 

~ Be^ kr - r 


For an ideal gas, the layers within which the energy may be con- 
sidered constant are the spherical shells included between the radii 
p, and p, 4- dp,. The equations connecting energy and momentum are 



* The corresponding multiplier was there denoted by A 



XXXVIII.] BOSE-E1NSTEIN AND FERMI-DIRAC 
Hence a shell contains 


dZ,~ 


47 rV 2m a u, du, 

WJf “ 


. ( 19 ) 


cells of volume k s /V, and. omitting the subscript s, the number of 
particles in the separate shells becomes 

_ 47 r V i \/‘2mhi du 


dN = 


p(j 5 e u/A*’_ f)- 


( 20 ) 


Finally, expresring u in terms of the velocity, we have the distribution 
formula 

(20') 


If B » 1, we obtain 


47 rVm 3 c?dc 
= W{Be.™'* hT - I)’ 


47t FwiVe ~ mc 'i zkT 


( 21 ) 


3 the classical statistics gave (p. 592) 


'vspY 


m / 


( 22 ) 


Thus when B is large, the Bose -Einstein statistics also leads to the 
Maxwell distribution. Equating the two expressions determines B: 


V(2nmkT)*l* 
Nh 3 


(23) 


We are justified in equating the two expressions for the reason that 
integration over all velocities must, yield the total number N of mole- 
cules in each case. The preceding equation shows that B is inversely 
proportional to N, so that our assumption of B~^> 1 has greater 
validity — and hence the Bose distribution agrees more nearly with 
the Maxwell form— when N is small, i.e. when the gas is at low pres- 
sure. Actually, deviations are first to be expected at those densities 
and those low temperatures at which all gases begin to exhibit strong 
van der Waals departures from the ideal gas law. Up to the present 
time it has not been found possible to separate the two kinds of devia- 
tions to the extent that one can decide with certainty whether the 
gas obeys the Buse statistics or that of Boltzmann. 


3. The Fermi-Dirac Statistics and its Application to Electrieal Con- 
ductivity. 

While the Bose statistics has not yet become of much practical 
importance on account of the difficulty of testing it experimentally it 
is, nevertheless, of great service in deriving a new form of statistics given 
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by Fermi and independently by Dirac.* This method appears to be 
the appropriate means of treating the electron gas in metals. In the 
theory of the structure of the atom it is shown that a given com- 
bination of quantum numbers "•an appear but once in each atom. 
This law, which is of fundamental importance, is known as Pauli’s 
Principle (cf. p. 679). Otherwise stated, this law says that a cell of 
phase space can be occupied by only one representative point. Fermi 
applied this principle to the totality of particles in a gas. Thus the 
numbers vanish for all values of n except 0 and 1. If the compu- 
tation is performed with this added hypothesis, all calculations for 
determining the most probable state remain unaltered. One must 
remember only that the summations for n are to be extended only 
from 0 to 1. The actual summations are not performed until the 
multipliers a, are determined. At this point the method departs from 
that of the Bose statistics. We have, at first, as for the Bose statistics, 
Z M = (24) 


To determine the a, we sum over all values of n; in this case the sum 
reduces to 

Z,^=a,(l + e-»+y.)) (25) 

The number of particles in the shell of subscript s becomes 


= l-Z u =a i c-W + ^ = 


3W&+T * ’ (26) 


As in § 2, (p. 630) y is found to be 1/kT, bo that 
Z, - 


N t = 


+■ r 


(27) 


This result has a remarkable similarity to the Bose distribution func- 
tion — the difference is only in the sign of the 1 in the denominator. 
This shows that the Fermi distribution, too, approaches the Maxwell- 
Boltzmann for large values of B, i.e. for low densities. Hence for 
ordinwy gases we can no more expect to test the validity of the Fermi 
statistics than we could that of the Bose method. But the situation 
is entirely different for an electron gas of the type we are .forced to 
assume exists within a metallic conductor (cf. p. 444 et seq.). If we 
assume that the number of conduction electrons is of the order of 
magnitude of the number of atoms, the density of the electron gas 
becomes so large that B is very small, especially since the small mass 
of the electron operates, according to equation (23), to affect the 
result in ihe same way. At ordinary temperatures, assuming the Fermi 
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statistics to be valid, the electron gas is completely degenerate, i.e. it 
obeys a totally different distribution law from that for an ideal gas. 

We now wish to use the Fermi statistics to derive the most impor- 
tant property of the electron gas. First, we must double the number 
of cells belonging to a given energy range, as compared with the number 
for.an arbitrary gas. The reason is this: it will be shown in Part VII 
that it is necessary to ascribe to the electron a magnetic moment of 
one Bohr magneton. According to the rules of the Quantum Theory, 
the axis of this moment can set itself either in the same or the opposite 
Bense to a given field direction. For each of these positions we have the 
number deduced above for the number of cells of the translational 
momentum space. If we now pass to a very weak field, the magnetic 
contribution to the energy vanishes and the two orientations become 



equal in energy, i.e. the existence of the magnetic moment of the 
electron doubles the number of cells in each shell. We thus have 




(28) 


Inserting this value in equation (27) and omitting the subscript s, we 
obtain the nujnber of electrons whose energy lies between u and u + du: 


dN 


8Tr7V2m» u 1 '* , 

— a» — -KWT: i 


. . . (29) 


We must now seek to express the constant P in terms of N by in- 
tegrating over all values of the energy. The result is 


„ 8nVVW r uWdu 
p J 0 BeUlkr+ r 


. . . (30) 


Unfortunately the integral eannot be evaluated in closed form. We 
therefore avail ourselves of a somewhat crude approximation: if we 
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draw the curve of the integrand function it is seen to "be a parabola as 
long as Bef‘ lkT can be neglected in comparison with 1 (cf. fig. 1), but as 
■soon as the exponential becomes comparable with unity, the actual curve 
descends and finally approaches the axis of « asymptotically. If we 
erect a vertical line through the point w 0 , which is determined by 
J5c"*/* r = 1 oi «„ = ~1cT logJ3, 

it is evident that the two shaded areas are approximately equal. We 
may therefore replace our integral approximately by the area under 
the parabola between 0 and w 0 . In order to avoid difficulties with the 
sign, introduce the constant If Am. place of B, i.e. 

~logS=+log^4 (31) 

Then the area becomes 

f , » f (feT logA) Sli . . (32) 


Inserting this in (30) and. solving for log A, we obtain 

. . a* / 3 n yi 8 

° gA ~ 2mkT \8irv) ’ * * ' 

and hence the distribution function becomes 
8nVV2m* u 1/2 du 

‘ gUtkT- (A'/SmAT) (3NiS«V)W j‘ 


dN = 


. (33) 

• (34) 


The result becomes especially vivid if, instead of giving the number 
of electrons whose energy is between u and u + du and -which are 
in the volume V, we specify their density in the momentum space. 
Since the shells have’ each a volume 

dr = 4 np 2 dj> — ‘brVSrnht. du, 
the density of points in the momentum space becomes 
_dN 1 

9 (fa. 00 eUlkT-(hVZmkT)(2Nia*V)™ ^ L •••(») 

For energy values less than 

a» / ?,n y 

2 m \8» tV) 

the denominator has the value 1 at T — 0; for energies greater than 
this value, it is infinite. Thus at T = 0 the density of the representative 
points as a function of the corresponding energy is constant up to a 
certain point and then drops suddenly to zero. This result can be 
understood on the basis of the Pauli Principle. Since each cell may 
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contain but one electron, all energy levels must be occupied, from the 
lowest upward, until all electrons are provided for. Raising the tem- 
perature then has merely the effect of rounding the edges, as one may 
verify by inserting numerical values of T which are greater than zero. 
In order to compute the energy of the electron gas we must evaluate 

‘ 0~{uJN. (36) 

In the same way as before we obtain as a first approximation 

8 ■nWfri? r 8 r kT]o ln, 

V = u / i <=» 75 / u 3 Z du 

" ± e w*r i i h? Jo 

A 

.HkTlogA)** (37) 


Inserting the value of log A, we obtain, finally, 

rT 167rFV'2^»A 5 fc 5,2 r 6/2 / 3 N Y ,s _ 4 ttFA 2 / 3 N \ 5,s 

5A 3 (2m) s/2 A 6/2 r 5/2 ‘ \8 itF / 5 m \%ttV) ' ( ) 

Thus to a first approximation the energy content of the electron gas 
is independent of the temperature; i.e. the energy found above is 
already present at the absolute zero. Rut since the specific heat is 
obtained by differentiating the energy with respect to T, it follows 
that the specific heat of the electron gas vanishes, to a first approxi- 
mation. Thus the greatest obstacle in the electron theory of metallic 
conduction is eliminated, for the classical statistics leads to an increase 
of the specific heat of the solid metal from 612/2 to 912/2 per mol, 
assuming the same density of electrons as here. This increase is not 
observed. On the other hand, the Fermi -Dirac statistics leads to the 
observed absence of a contribution to the specific heat. It is a matter 
for a more complete calculation to compute the electrical and thermal 
conductivities of metals based on the new distribution. This has been 
done by Sommerfeld and his co-workers in a series of papers, but the 
details cannot be given here.* A further advance was made by taking 
into account the interaction with the lattice ions by the introduction 
of a spatially periodic potential. This computation must be based on 
quantum mechanics and so it is described later (Chap. XLI, p. 707 et seq.). 

In this way we arrive at a serniquantitative understanding of aU eleotrical 
and optical properties of metals except one, whose explanation is lacking, viz. 
snperconduct.imty. This consists in the fact that at a few degrees above absolute 
zero the metals of columns 116, UTS, IVa, IV6, Vo and Via of the periodic table 
(p. 676) as well as several alloys suddenly lose all electrical resistance. The tran- 
sition temperature is a characteristic of the material. Connected with the dis- 
appearance of resistivity is the disappearance of the magnetio permeability. 

• Consult A. Sommerfeld, ZeiUxhr.f. Phyt., 47 1-2, pp. 1, 43 (1928). 
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4. Entropy Constant and Ghemical Constant of an Ideal Monatomic das. 

The thermodynamics of the vapour pressure curve and of chemical 
equilibrium peremptorily demands that the absolute value of the 
entropy should be determined (cf. p. 553). Since the Quantum Theory 
leads to a determination of the size of the element of phase space, it 
is to be expected that the absolute value of the entropy may be cal- 
culated readily, for the size of the element enters into this quantity 
(cf. p. 563). However, a difficulty exists in respect that the classical 
quantum statistics yields a quantum condition only for periodic 
motions. Now the translational motion of the atoms of an ideal gas 
(we restrict the present discussion to ideal'monatomic gases) is entirely 
non-periodic ; if we wish to apply the Quantum Theory we must arti- 
ficially introduce a periodicity, say by treating the to and fro motion 
between the walls as a periodic motion. Moreover, in order to arrive 
at a reasonable result with such considerations, we find ourselves facing 
an inconsistency with the Boltzmann statistics; here we do not dif- 
ferentiate between two states resulting from the interchange of two 
atoms, while in Boltzmann’s method two such complexions are 
taken to constitute two different micro-states. For this reason we 
refrained from deriving the chemical constant on the basis of the 
classical quantum statistics (Boltzmann statistics augmented by the 
quantum condition), since this actually uses the principal feature of 
both the Bose and Fermi methods, viz. the absence of any distinction 
between like atoms, which obtains even in the micro-state. The ab- 
solute value of the entropy follows in a simple manner from the postu- 
lates of the new statistics. The Bose and the Fermi statistics both 
lead to the same result, since according to equation (23) (p. 631), the 
constant B for ordinary gases is large compared with unity, and so 
the “ one ” in the denominator of each distribution function may 
be dropped. This makes the two forms identical. 

In order to derive the entropy constant we start from the formulae 
of § 2 (p. 629): 

S = &EZ, logZ, — h'L'S,Z n , logZ M , . . . (6) 


with the auxiliary conditions 

SZ n ,= Z„ . . . 
SSnZ„^N = SN„ 
S2m*.Z n ,= XN,u t . 


( 6 ) 

( 7 ) 

(8) 
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There we found the most probable distribution to be 

Zn. = a l/ e—‘V+u t lkT) > ( 13 ) 

Z, = o,(l - (14) 

N. = Z t {eP + ^ kT - l)- 1 (16) 

Under the assumption that i.e. 1/B = e~ f <C 1, we obtain 

log Z, = log a, + e~ p ~ u ' lkT = log a, + g e~^ kT , . (39) 

(40) 

If we insert the values from (39) and (13) in the logarithms of equation 
(5), we obtain 

S = kXZ. log a, + AS I* e-V-r — AS S Z n , log a, + AS S Z n , np 

+ (41) 

According to equation (6), the .first and third terms drop out. By 
equation (40), the second term is equivalent to ASIV, — AZV; by (7), 
the fourth term is kfiN, and according to (8) the last term is U/T. 
Thus the equation simplifies to 

S=kN + kfiN + % (42) 

Taking one mol, i.e. L atoms, and putting u — ZRT/2, we obtain 

s=iJ + j8i2+|J?=j82J+ffi. . . . (43) 

Using the expression for ft — log B given by equation (23) (p. 631) 
we obtain s as a function of v and T: 

l = logt; + |logT + f + log ( -^^. . . (44) 

In the definition of the chemical constant, s is assumed to be a function 
of p and T. Performing the conversion with the help of the equation 
of state we have, since R = kL, 

- = f log T - logp + § + log = | bg io gp + (46)' 

and since c„ = 5.B/2, the chemical constant of an ideal monatomic 



(2ttw) 3/2 A; 5/ * 

A 3 


• • ( 46 ) 
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Inserting the numerical values of the atomic quantities and intro- 
ducing the logarithms of the factors needed for the conversion to 
practical units (p in atm.), we obtain the formula quoted on p. 538 : 

i = — 1*59 + f log 4 (47) 

Ex. 131. Thermal Ionization : Calculate the chemical constant of the electron 
gas in the non-degenerate (rarefied) state and use the result to compute the 
thermal equilibrium between atoms, ions and electrons in a hot gas in terms of 
the work of ionization of the atom. 



PART vn 

The Structure of Atoms and Molecules 
and the Theory of Spectra 



CHAPTER XXXIX 


The Mechanics of Simple Atomic Models 

1. Investigation of the Structure of the Atom; the Scattering of 
a-Particles. 

The many phenomena in which electrons are released from neutral 
atoms or atomic aggregates — e.g. the photoelectric effect and the 
phenomena of electrical discharge in gases— suggested from the very 
first that the atom is a structure made up of electrons, the charge of 
which is compensated by an equal number of positive elementary 
charges. The question immediately arises of how the two kinds of 
charge are arranged within the atom, and where the inertial mass is 
located. This question was attacked experimentally by probing or 
sounding the atom by means of projectiles of subatomic size. These 
were sent through the atoms, and the arrangement of the charges was 
deduced from the alterations in the paths of the particles. Lenard, 
who Jfirst undertook experiments of this kind, used electrons as pro- 
jectiles. He recognized the fact that the volume occupied by an atom 
in gas-kinetic collisions or in liquids and solidB is merely the effect of 
electrical fields of force due to charges within the atom, while the 
intrinsic volume of the parts of the atom is vanishingly small. Ruther- 
ford and his associates obtained particularly simple conditions by 
using a-particles expelled from radioactive substances instead of 
electrons. The former carry a positive charge of two elementary 
units and, like the electrons, are of exceedingly small size, but have a 
mass almost 7400 times as great — the same as that of a helium atom. 
For this reason, the deflection of the path of an a-particle requires a 
centre having a large inertial mass, inasmuch as the conservation of 
linear momentum requires that the centre of gravity of the two bodies 
should continue in its original motion. For a collision between an 
a-particle and an electron, the centre of gravity is practically coinci- 
dent with the former, so that an .electron is not capable of appreciably 
deflecting an a-particle. Hence the use of a-particles in this way is 
able to furnish information concerning the location of the mass of the 
atom, so far as it is connected with electric charges. C. T. R. Wilson 
found * that if ions are present in supersaturated dust-free air they 

• 0. T. R. Wilson, Phil . Tram ., A, 189, p. 266 (1887). 
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act as centres of condensation for water droplets. This is the principle 
of Wilson’s so-called “ cloud chamber If the ions are produced by 
the passage of a-particles through the air, the path of the particle is 
outlined as a “ fog track Photographs of such tracks show that a 
deflection through a large angle occurs suddenly, rather than by the 
cumulative effect of many smaller deflections. This further simplifies 
the theory of the effect to the extent that we may limit our calculations 
to simple scattering and omit from consideration the rare case of a 
single particle experiencing multiple deflections. 

Consider now what happens to a single particle. We shall ascribe 
to the scattering centre a mass so great that we may consider it at 
rest during the scattering process. Since the central mass can in no 
instance be greater than that of the atom, this holds only for scattering 
by atoms of heavy elements. Actually, the first experiments were 
made with such elements. Assume the scattering centre to have Z 
elementary charges. Then the force acting on the o-particle is 
2Ze a /r a . Since this law of force is the same as that of gravitation, so 
far as the dependence on r is concerned, we can apply the results of 
p. 91 at once. The path is thus a conic section with the scattering 
centre at one focus. This leaves the question of the sign of the central 
charge open. The hyperbolic orbits in either case differ only to the 
extent that the central body occupies the focus on the concave side 
when the force is attractive, and is at the focal point on the convex 
side when the force is one of repulsion. In harmony with other experi- 
ments, in which the positive charge is always associated with masses 
which are integral multiples of that of a hydrogen atom, we assume 
the central charge to be positive. Comparing the potential energy of 
gravitation — yrnMjr with the electrostatic energy -f2 ZeP/r, we see 
that yM must be replaced by —2 Ze 2 /m in the formulae of p. 90, where 
m is the mass of the moving a-particle. In order to avoid confusing it 
with the mass of the electron, we give it the subscript a in what fol- 
lows. According to equation (21) (p. 91), the equation of the orbit 
becomes «, . 


/ 2 . 4Ze 2 , ZW 

1 + zW l ’° + ^ 0 + ^v cos ^ 


where c is the areal velocity. The negative sign signifies that if the 
positive value of the radical is taken, the branch of the hyperbola 
directed away from the centre is the one described by the particle. 
If, for example, we take <f> — 0, the corresponding value of r in fig. 1 
is not to be drawn to the right, but to the left. At the beginning of 
the motion the particle is at an infinite distance, i.e. 1/r is zero, and 
its velocity is v 0 . If the initial direction, which is represented by the 
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asymptote of the hyperbola, is a distance p from the centre, the areal 
velocity at the beginning is v 0 pj2, since the curve and the asymptote 
are practically coincident at great distances from the centre. This 
areal velocity is represented by the shaded triangle in fig. 1. Because 



of the constancy of the areal velocity we may put this value in 
(1) in place of c, thus obtaining 

_ 

2e 2 Z 


1 + 


v 0 pm, 
2Ze z 


‘V W o a 


m a \y 


The numerical eccentricity becomes 


_ v 0 pm u I 2 , 4 Z z e* 

~ ?Ze z M V ° T Vp 3 m/ 


( 2 ) 


( 3 ) 


Denote the acute angle between the initial direction of the particle 
and the positive x-axis by y. Then the angle of deviation, i.e. the angle 
between the asymptote and the one corresponding to the final direc- 
tion of motion, is giveD by 

l~l-r W 


From the geometry of the hyperbola, 
1 


cos (jt — y) - 


coty = tan^=- 7 =_. 


* It follows from the polar equation r — — p/(l •+ 
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so that half the angle of deviation is given by 



This simple formula gives the angle of deviation as a function of the 
parameter p; it is impossible, however, to test the validity of the 
equation experimentally, since we cannot directly determine the dis- 
tance of the initial direction of a given particle from an atomic nucleus. 
The result of the computation can be tested only by statistical methods, 
in which the theory of probability is applied to the distribution of the 
various parameters in order to determine that of the angle of de- 
flection. In fact, this marks a weakness in logic of the entire atomic 
mechanics, as based on models: the quantities necessary to determine 
the course of a single process cannot be determined exactly;* then, 
if we pass to a statistical distribution, the quantity p, which cannot be 
observed, again drops out of the calculations. Thus the elementary 
process is used only as an intermediate concept. The direct procedure 
would be to assume that a pencil of a-rays passes into the field of 
atomic nuclei and to inquire into the distribution of the rays after 
passing through this region. This is, in fact, the way in which such 
problems are attacked by the newer quantum mechanics to be dis- 
cussed in the following chapter; there an attempt is made to operate 
only with directly observable quantities. For the moment, however, 
we shall continue to discuss the atomic models, and shall now compute 
the angular distribution of scattered particles by means of such con- 
cepts. 

If the layer of atoms which is penetrated has a thickness s, and if 
there are N atomic centres per unit volume, then a hollow cylinder of 
radii p and p + Ap whose axis is in the direction of the incident rays 
contains, on the average, 

AN = 2irpsNAp (6) 

nuclei. For the small values of p which are considered, this is a very 
small number, e.g.- 0-001. The introduction of an average value is 
justified by the fact that the primary pencil has a finite cross-section, 
so that the layer is penetrated at various points, which is equivalent 
to using an infinitely narrow pencil of rays and moving the bom- 
barded layer of atoms to and fro within che same limits of variability. 
Assume that each particle is scattered once ; then, of n primary par- 
ticles, a number 

An = n . 2-trpsNAp (7) 

* Later we shall find that these quantities are inherently indeterminable, quite 
apart from the insufficiency of our methods of attack. Cf. p. 690. 
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will suffer “ impacts ” with a nucleus lying between p and p + A p. 
Now by (5), p is connected with the angle of scattering, 0, by the 
formula 

P = 


A p — 


so that the fractional number A n/n of a-particles scattered between the 
angles 0 and 0 -f- A0 is 

. insNftZ* cosf 

(9) 

Formula (9) becomes invalid when 0 = 0, for then it yields A njn = oo . 
This would mean that practically all a-particles go through without 
being scattered. The discrepancy is explained by the fact that we are 
not permitted to take an infinitely large upper limit for p but must 
use the finite value given by the mean distance apart of the atoms of 
the scattering material. 


2e*Z . 9 } 
v 0 2 m a C ° 2 I 

e 2 ZA0 [ 
Vw» a sin 2 0/2 j 


2. Conclusions from the Experiments on the Scattering of a-Particles. 

The scattering formula (9) is well confirmed for the heavier elements 
as scattering centres and for finite angles of deflection. The experi- 
mental verification of the scattering formula justifies, for one thin g , 
our hypothesis of electrostatic forces between the particles and the 
nuclei and confirms the assumption of simple scattering. Further, a 
comparison of the scattering powers of various substances shows that 
N may be set equal to the number of atoms per unit volume and that 
Z coincides with the place of the element in the periodic system. We 
shall discuss this important result in greater detail below. 

Deflections of 90° are sometimes observed. If we use equation (9) 
to determine the value of p corresponding to a-particle velocities of 
10* cm. /sec. and to a deflection angle of 90°, we obtain a value of the 
order of 10 -12 cm. This defines the radius of the sphere within which 
the inertial mass of the atom is concentrated, and the Coulomb law of 
force is still valid at these distances. We shall refer to this central 
structure as the nucleus of the atom. Its size is vanishingly small 
compared with the gas-kinetic radius of the atom, which is of the 
order of magnitude of 10 -8 cm. This extension in space must therefor© 
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be maintained by the electrons surrounding the nucleus; their elec- 
trical fields of force prevent the closer approach of other atoms. 

However, the most important result of the scauiering experiments 
is the recognition of the fact that the nuclear charge Z consists of a 
number of positive dementary charges equal to the position number of 
the element in the periodic table. Formerly, the elements were arranged 
in the periodic system in the order of atomic weight. At certain places 
this sequence had to be violated in order to satisfy the sequential 
arrangement according to certain chemical properties. We now re- 
cognize that the nuclear charge is the fundamental parameter, and 
this makes the order correct at the places mentioned. Since the atom 
as a whole is normally electrically neutral, the number of electrons 
outside the nucleus must also be equal to Z. The position number Z 
of the element was called the atomic number by Moseley (of. p. 666). 
We thus have the important relation: 

Nuclear charge = Number of electrons outside the nucleus 
= Atomic number of the dement. 

Hydrogen, of atomic number 1, has a nucleus with one elementary 
charge whose mass is that of a hydrogen atom (less the mass of the. 
electron). M a is 1-66 X 10~ 24 gm. The hydrogen nucleus is to be 
looked upon as the atom of positive electricity, which differs from its 
counterpart — the electron — in having a much larger mass. The 
nucleus of the hydrogen atom is called a proton. Further, since helium 
has the atomic number 2, and since the a-particle has a resultant charge 
of +2, and a mass equal to that of the helium atom, we recognize the 
a-particle to be a helium nucleus. One might conclude that the nuclei 
of the heavier elements consist of protons only, but there are two 
serious objections to this idea: since the mass of each proton is that 
of a hydrogen atom (the mass of the electron may be neglected in 
comparison), the atomic number would have to be equal to the atomic 
weight, which is obviously not the case. Prior to the discovery of the 
neutron (p. 750) it was assumed that electrons were present in the 
nucleus itself, thus serving to reduce its net charge. Supporting this 
view was the fact that in many radioactive changes — which are cer- 
tainly nuclear processes — electrons are emitted in the form of j3-rays. 
However, the assumption that electrons exist in the nucleus led to 
such great difficulties that it had to be abandoned, and now we explain 
the difference between atomic weight and nuclear charge in terms of 
neutrons present in the nucleus. 

The second objection — that if the elements are built up of protons 
and electrons all atomic weights, referred to hydrogen, must be whole 
numbers (which is evidently not so) — was removed by the discovery 
pf isotopes. As shown especially by Aston in his analysis of canal rays 
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with the mass spectrograph (p. 441), the chemical elements are largely 
mixtures of atoms of different atomic weights, but having the same 
nuclear charge and hence the same number of outer electrons. Apart 
from small deviations, the several atomic weights are integral multiples 
not of that of hydrogen itself, but of a weight about 8 per cent smaller, 
i.c. 1 /16 the atomic weight of oxygen. The small departures from the 
integral multiples are to be interpreted on the basis of the principle of 
the inertia of energy as differences in'the energy of formation of the 
nuclei. The large value of these energy changes (obtained by multi- 
plying the difference in mass by the square of the velocity of light) 
corresponds with the order of magnitude of the energy released in 
radioactive processes. 

3. The Bohr Model of the Hydrogen Atom. 

Now that we have before us a general rough picture of the atom 
in the form of a heavy nucleus about which electrons cluster, we 
naturally desire more definite information concerning the arrange- 
ment of these electrons. The first question which arises pertains to 
the nature of the force which balances the electrostatic attraction of 
the nucleus, thus preventing the electrons from falling into it. Similar 
situations in celestial mechanics suggest that the opposing force is 
.centrifugal force. We are thus led to assume that the electrons execute 
rapid motions of revolution about the nucleus. This idea was followed 
out mathematically by Bohr,* first for the simplest case of the hydro- 
gen atom, and with the simplest form of electron orbit— the circle. 
There is one great difficulty. The revolving electron, together with 
the positive nucleus, represents a miniature antenna in which two 
vibrations which differ in phase by tt/ 2 are taking place. According 
to the laws of electrodynamics, an oscillator of this kind must radiate 
energy in the form of electromagnetic waves. Such an energy loss 
can occur only at the expense of the potential energy, i.e. the orbits 
must decrease in size, so that the electrons traverse spiral orbits which 
ultimately lead into the nucleus. This cataclysm must take place 
within time intervals of the Order of 10 -8 sec., so that stable atoms of 
this kind are out of the question. But the validity of the electro- 
dynamic laws has been demonstrated experimentally only for macro- 
scopic antennae; it may be that other laws obtain for the individually 
unobservable atomic radiation processes, which are limited only by 
the requirement that the average effect be in harmony with the macro- 
scopic electrodynamic results. Bohr thus made a first postulate to the 
effect that of the continuous sequence of mechanically possible orbits, 
only a discrete set is capable of existence. These orbits are assumed 


‘ N. Bohr, Phil. Mag., 86, p. 1 (1913). 
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to have the property that no radiation takes place when the electron 
is in one of them. The criterion for such an orbit is taken to be the 
condition that the phase integral (p. 130) is an integral multiple of 
Planck’s quantum of action: 

J — (j>pdq = nh (10) 

The second postulate is that the emission or absorption of radiation 
(light) takes place according as the atom passes from a higher to a 
lower energy state or vice versa. At the same time a quantitative re- 
lation found experimentally to hold for all elementary energy trans- 
formations, e.g. the photoelectric effect, must be satisfied: the energy 
difference of the two states must be equal to the quantum hv of the 
emitted or absorbed radiation. No statement is made regarding the 
actual mechanism of the process of radiation or absorption. Also, 
there is apparently no connexion between the frequency of the radia- 
tion and the frequency of revolution -of the electron in its orbit; we 
shall find, however, in § 6 (p. 665) that there is an asymptotic con- 
nexion between the two. 

Let us now apply the two postulates to the hydrogen atom, whose 
electron revolves about the nucleus in a circular orbit of radius a. 
The kinetic energy T amounts to 

T — \rm?o) 2 — $ma 2 <p, (11) 

so that the momentum co-ordinate is 

p* = ma 2 <£ = ma 2 <a (12) 

The phase integral is to be extended over an entire circuit of the 
position co-ordmate — in this case from 0 to 2ir: 

J = £ pdq = J ma 2 a)dtf> = 2irma 2 a>. . . (13) 

We then have the quantum condition 

2wtm 2 ai = nh . (14) 

In addition, there is the equation of mechanical equilibrium 
e 2 

^5 = maa? (15) 


There are thus two equations for the two unknowns a and <a. Solving 
for these quantities, we obtain 


- nh 
~ inbne*' 

_ 87 
n 3 h* 


(16) 


and 


(17) 
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Inserting numerical values for e, h and m, we obtain for n = 1 the 
value 

a 1 = 0-528 X lO- 8 cm., (18) 


i.e. a value which corresponds very well with the gas-kinetic atomic 
radii. The fact that a purely chance agreement between these quan- 
tities is highly improbable lends plausibility to this model of the 
atom. The values of a> correspond to the frequencies of ultra-violet 
light; although the radiated frequencies have no immediate connexion 
with to, it seems significant that they are of the same order of mag- 
nitude. In order to compute the actual radiated frequencies on the 
basis of the second Bohr postulate, we must write ‘down the energy in 
the nth quantum state: 


£„=T+ CI = |a n W- 


(? _ 2'rr z me t 
T n ~ nVt? 


knhnet 


2-7T 2 me 4 

~riW r ’ 


(19) 


The negative sign arises as a result of taking the zero point of the 
potential energy at infinity. The potential energy, which is exactly 
twice the kinetic energy, is negative in the finite region, since work is 
obtained when the distance between the nucleus and the electron 
decreases from infinity. The magnitude of the energy with a positive 
sign signifies the work which must be done in order to remove the 
electron from its orbit to infinity, i.e. to detach it from the atom. 
This work is usually expressed in “ electron volts ”, i.e. the difference 
of potential (in volts) through which an electron must fall in order to 
obtain this amount of kinetic energy. The ionization potential of an 
atom is the name given to the energy difference, expressed in electron 
volts, between the normal state and the state of complete removal. The 
former is the state of lowest energy — the one having the lowest quantum 
number; for hydrogen this is 1. The excitation potential of a given state 
is the difference in energy between this and the “ground” state. 

According to the second Bohr postulate the frequency of the light 
emitted or absorbed in a transition from one state of the atom to 
another is given by 

— i *-«-*£='(£-£). • • m 

where the subscripts i and / denote the initial and final states, re- 
spectively, for the case of emission. In point of fact, all the spectral 
lines emitted by the hydrogen atom do actually satisfy this law. The 
sequence of lines with n f =2, n ( — 3, 4, 5, ... , four lines of which 
are in the visible region, has been known for a long time as Balmer’s 
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series. The lines crowd closer together as the “ current number ” n t 
increases, and at the series limit — given by n { — co , he. vanishing 
variable term — there is an accumulation point of spectral lines, in the 
mathematical sense. Actually the intensity of the lines decreases as 
the series limit is approached. The 



series of lines belonging to n f = 1, 
n< = 2, 3, 4, ... lies in the far 
ultra-violet region, and was not 
found until later by Lyman. The 
lines corresponding to n, — 3, 
n { — 4, 5, ... and to n f — 4, 
n { — 5, 6 , 7, . . . are in the infra- 
red, and are named after their dis- 
coverers the Paschen Series and 
the Brackett Series respectively. 

The origin of the individual 
lines is best visualized with the aid 
of an “ energy level diagram ” as 
in fig./2, in which the various energy 
levels which can be assumed by 
the atom according to the first 
Bohr postulate are represented by 
properly spaced horizontal levels. 
The transitions are represented by 
arrows in the direction of tran- 
sition. The central numbers are 
wave-lengths in Angstroms. 

An outstanding success of the 
theory is to be seen in the fact that 
it not only yields the form of the 
law obeyed by the hydrogen lines, 
but also gives the correct numeri- 
cal value of the constant of the 
formula, when the numerical values 
of e, m and h are inserted. It is 
true that the accuracy to which e, 


m and h are known is far below 


that available in spectroscopic measurements, so that once we have. 


attained confidence in the theory, we shall rather use the constant to 


determine e, m and h. 


It is customary in spectroscopy to use the reciprocal of the wave- 
length, 1/A, rather than the actual frequency. c/A. These so-called 
wave numbers are also designated by v. In order to avoid confusion, 
we write v with a bar over it to denote wave numbers. If the Balmer 
formula is written in these units, 
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the constant R is called the Rydberg Number. By equation (20), this 
constant is given by 

- 2ir 2 ehn /on . 

’ (22) 

and its numerical' value is 

R = 109737 cm 

4. Consideration of the Motion of the Nucleus. The Spectrum of Hydro- 
gen and the Spectrum of Ionized Helium. 

Even if we neglect the relativistic variability of the electron mass, 
the simple formulae derived in § 3 (p. 649) are in need- of refinement, 
since we assumed the nucleus to be at rest. This is true only for in- 
finitely heavy nuclei; in reality, the motion takes place about the 
common centre of gravity of nucleus and electron, and in computing 
the energy we must take into account the kinetic energy of the motion 
of the nucleus also. Denoting the mass of the nucleus by M, its dis- 
tance from the centre of gravity by A, and the distance of the electron 
from this point by a, we have 

MA *= ma (23) 

The condition of mechanical equilibrium for the nucleus is 


md for the electron. 


It is found expedient to expx-ess both a and A in erms of the distance 
(a -f- A i between nucleus and electron. From, equation (j 3) it follows 


a ~fri 


We define a “ reduced ” mass \x by means of the eqnatii 


Then 



[Chap. 


652 ATOMS, MOLECULES AND SPECTRA 


and the condition of equilibrium for the electron becomes 

5T2?“'‘< a - M )“ A • • • • 


The total energy becomes 


(29) 

(30) 


Comparing these formula with the corresponding ones of § 3, we see 
that they become identical if we replace the electron mass m by the 
reduced mass ft. and if we understand a„ in § 3 to mean the distance 
between the nucleus and the electron. Then the Rydberg constant 
for hydrogen becomes 


R„ = ^T 5 = 109677-69 cm "S ^1 = 1 + (31) 


For an element with infinite nuclear mass we have the former value 
of this constant: 


R = R m 


2w i g 4 m 

ch 3 


( 22 ) 


For a one-electron system whose nucleus has the mass of the helium 
nucleus, we have 


Ru* 


2v 2 eV g< , /_L = 1 , 1 \ 

ch 3 ’ \Mh« m M h J' 


. • -(31') 


The spectrum of this system is found in nature. If one of the two 
electrons of the helium atom is removed by violent excitation we have 
such a system; electrically this structure differs from a hydrogen atom 
only in having twice the nuclear charge. This means that the factor 
2 must be supplied to e 3 in the equation of mechanical equilibrium. In 
general, for a nucleus having a oharge Z, the factor Z must be inserted. 
The formula for the energy levels contains e 4 , so that the factor 4 (in 
general, Z 2 ) must be supplied; the energy levels of the singly-ionized 
helium atom (J2e + ) become:* 


_ 4x ‘Itfietpn' _ 4R He ho 

n 3 h 3 n 2 


. (32) 


Thus the wave-lengths of the lines which correspond to the hydrogen 
lines are one-fourth those of the hydrogen lines, so that the He* spec- 
trum corresponding to the Lyman and Balmer series lies in the far 
ultra-violet. For n f — 4, however, the lines fall in the visible- region. 


* The negative sign is omitted before E„, since it is customary in spectroscopy to 
reckon the term values as representing work done in removing an electron. 
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Writing this series in the form 

• • (33) 

we recognize that alternate lines must coincide approximately with 
hydrogen lines. This is the so-called Pickering series, which has been 
known for a long time in stellar spectroscopy. It was formerly attri- 
buted to hydrogen, inasmuch as se close a connexion between the 
spectra of two different elements was looked upon as quite improbable 
before the advent of the Bohr theory. 

The difference between the H and He* wave-lengths is obtained 
by replacing M He approximately by 4 M n : 


R b __ M B (m + M b> ) _ 1+ 4M g - m 

Rh» M Hl (m + M b ) . m_ * M B 

+ M b 


(34) 


This shows that the wave-lengths of the hydrogen lines are 0-04 per 
cent greater than those of the neighbouring He* lines. Reversing the 
procedure, we can use the very accurately measurable difference in 
wave-length to determine e/m spectroscopically, for we have 
e 



m m 


In the exact computation we must introduce — besides the precise 
atomic weight ratios — a series of corrections — e.g. the difference be- 
tween atomic mass and nuclear mass must be taken into consideration. 

Equation (31) is of importance in connexion with the discovery of 
deuterium, the hydrogen isotope of mass 2, whose nucleus is called a 
deuteron. Setting the doubled value in place of M a in this equation 
yields the positions of the deuterium spectrum lines in excellent agree- 
ment with experiment. 


5. X-ray Spectra. Moseley’s Law. 

An atom of atomic number Z which has lost all but one electron 
will emit radiations given by 

' = < 36 > 

if we neglect the motion of the nucleus, which, moreover, is of less im- 
portance as the atomic number increases. On account of the factor Z 2 , 
atomic numbers as low as 30 bring these frequencies into the ordinary 
X-ray region. Highly ionized atoms of this kind cannot be produced 
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experimentally; nevertheless, there are X-ray lines -which, in part, 
conform to the above law. In order to understand their origin we must 
examine in more detail our ideas concerning the structure of the heavier 
atoms. We have every reason to believe that even when such an 
atom is in its nond&l state all electrons are not in the lowest quantum 
state, corresponding to n = 1. Bather, there are groups of electrons 
having the same quantum number n which we may provisionally 
imagine to be arranged in various shells of radii a*, a 2 , . . . a % . These 
shells have come to be denoted by the letters K, L, M, N . . ., corre- 
sponding to the quantum numbers 1, 2, 3, 4. . . . Now the X-ray spectra 
differ experimentally from the visible and ultra-violet spectra in that 
absorption lines do not occur. We observe merely continuous regions 
of absorption having abrupt terminal edges at the long wave-length 
end. This phenomenon can be explained as follows: in the case of 
visible spectra, the outermost electron is brought to a higher orbit 
in the process of absorption. If, however, one of the inner electrons- - 
say a K electron— is to be raised to a higher level, we find that all 
the places in the L, M, . . . shells of a normal atom are occupied, so 
that this electron can find room only at the periphery of the atom. 
But the work necessary to bring it to that point is practically the 
same as that required for complete detachment, assuming the electron 
to possess no kinetic energy after removal. The long wave absorption 
limit, then, corresponds to the work done in detaching the inner elec- 
tron, and the continuous spectrum which follows it corresponds to 
transitions in which the electron possesses varying amounts of kinetic 
energy after leaving the atom. Since the energy of translation of a 
free electron is not of a periodic nature it is not subject to a quantum 
condition of any kind, and so the energy levels form a continuous 
sequence. A similar continuous spectrum is found at the series limit 
of visible spectra, both in absorption Shd in emission. These coatinua 
originate in a similar way. 

If, now, an inner electron is detached from a heavy atom by radia- 
tion or by electron impact, its place can be filled by an electron from 
another shell. This results in radiation. Neglecting all interactions 
between the eleotrons, we would find that the X-ray emission lines 
originating in this way follow the hydrogen formula. In any actual 
case it must be remembered that according to the laws of electro- 
statics the electrons in the inner shells screen the nuclear charge from 
the surroundings, so that we must introduce a screening constant s n 
for each shell in such way that the energy of an n-quantum electron 
is given by 

(37) 

The fact that * n is in large measure independent of Z is a justification 
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of this somewhat crudely constructed theory. We can set s — 1 for 
the K and L shells; then for the line corresponding to a transition 
from the L to the K shell we have 

= . . (38) 

This is the celebrated Moseley Law for the K „ line.* This line 
is found in the X-ray spectra of all elements from Z — 4 to Z = 92. 
The atomic number of any element can be deterinined at. once with 
the help of Moseley’s Law. However, the detailed theory of the X-ray 
terms is very complicated, and by no means yet complete. We have 
given here only the broadest outlines of the theory. 


6. The Correspondence -Principle. 

Equation (17) (p. 648) states that the orbital frequency of the 
electron in a hydrogen atom is given by 


4t rhne? 
h a n a ‘ 


(17) 


According to the classical connexion between radiation and electrical 
oscillations this is the same as the radiated frequency. On the other 
hand, the Quantum Theory gives the expression 


277*6% n,- 2 — - n f % 


for the frequency of the radiation connected with the transition 
» ( -*■ n t . If — n f is small compared with n { we can write instead 


•'qu — 


47T*e% 

~W>nj r 


(w, — »,). 


( 20 ') 


We see that in the limiting case of large quantum numbers the tran- 
sition An = 1 yields a frequency identical with the classical frequency, 
the transition An = 2 gives the first harmonic 2v d , &c. Thus there 
is an asymptotic connexion between the frequencies given by the 
Quantum Theory and the classical frequencies corresponding to the 
mechanical motions of the system. For this reason a quantum theory 
computation of the radiation from an antenna would lead to the same 
result as that already obtained. The extension of this relationship, 
which is called the Principle of Correspondence, makes it possible to 
fill, in part, a gap in the previous theory. Thus, while the classical 
theory of the radiating oscillator permits a computation not only of 

* H. Q. J. Moseley, Phil. Mag., 26, p. 1024 (1913); ibid. 27, p. 703 (1914). 

Since «„ shows a certain progression with Z, it is only the middle elements for 
whioh equation (38) is obeyed exactly. 
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the frequency but also of the intensity, polarization and coherence of 
the emitted radiation from a knowledge of what is occurring in the 
system (cf. p. 335 el seq.), the Bohr theory gives only the frequency. 
In the language of the Bohr theory, the intensity corresponds to the 
frequency of occurrence of a quantum transition. The two fundamental 
postulates have nothing to say concerning this quantity. 

Bohr then assumes — and Ms assumption is justified by its success 
—that an asymptotic connexion obtains for intensity and polarization 
also, in such a way that the frequency of the transitions producing 
light of a given state of polarization is determined by the amplitude 
of the radiation as calculated on classical principles. For small numbers 
it is not determined whether the initial or the final orbit is to be taken 
— it is possible only to make an estimate. The results become unique, 
however, if a given ‘harmonic component is absent from both the 
initial and the final states. In that case the corresponding quantum 
transition certainly does not occur. In this way we arrive at principles 
of selection wMch in many cases limit to a great extent the number 
of possibilities of combination of the various energy levels. 

The Correspondence Principle also points the way to an extension 
of the quantum hypotheses to systems with several degrees of freedom. 
According to the classical theory, the frequencies of a system are given 
by (cf. p. 130) 


dE dE dE 

v '~d7i v *~dJ 2 ’ Vs ~ dT a ' 


. • ( 39 ) 


hence any combination frequency is given by 
dE , 'dE , SE 

Vcl = * 1 dT 1 + s *dT 2 +s *dT 3 --- * * * (40) 


where the s< are any positive or negative integers. Again, for a rela- 
tively small energy change, the energy difference is given as a function 
of the changes of the phasejntegrals. Neglecting higher terms in the 
Taylor expansion, 

AE = 4 -*"-^ 1 aj >+9T t A/ *+lr, A ' / ».'. • <«> 

If the asymptotic relation between the classical and the quantum 
frequency is to be retained, the phase integrals for the case of several 
degrees of freedom must change by integral multiples of h. TMs con- 
dition will certainly obtain if we impose the requirement that not only 
their changes but the quantities themselves are to be integral multiples 
of h. But it follows also from the asymptotic connexion that for, de- 
generate systems there are only as many quantum conditions as the 
syBtem has periods. The Kepler ellipse in space is a doubly-degenerata 
problem, for corresponding to the three co-ordinates of the electron 
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there is but a single frequency — that of the motion in the orbit. This 
means that the energy states of the Kepler ellipse depend upon only 
one of the action variables, viz. J 1 (cf. p. 135). This determines only 
the major axis. Hence all ellipses having the same major axis have the 
same energy. The circular orbit also belongs to this set; therefore the 
formulae of § 3 and § 4 are valid also for elliptic orbits, provided we 
understand a to mean the semi major axis. The introduction of elliptic 
orbits is necessary, since with only circular orbits the Correspondence 
Principle would yield the result that on account of the absence of 
overtones in the initial and final states only transitions for which 
Aw = 1 would be allowed. This means that only the first member of 
each series would appear. For elliptic orbits, on the other hand, all 
higher harmonics are present. 

7. The Atomic Spectra of the Alkalis, the Alkaline Earths and similar 
Systems. 

All calculations of the energy levels of more complicated atoms 
start from the hydrogen terms, and the modifications of the term 
values are approximated to by calculating the perturbations due to 
the other electrons. In order to be able to perform this calculation, 
we require a far-reaching schematization of the atom, whose justifi- 
cation is, moreover, given by the newer Quantum Theory. We consider 
first a system which we have reason to believe, especially from chem- 
istry, consists of a loosely-bound electron — the valence electron — and a 
core consisting of the remaining (Z — 1) electrons, and a nucleus with 
Z charges. The normal atoms of the first column of the periodic table 
— the alkalis and the series Cu, Ag, Au — are represented by such a 
system. Because of the screening effect of the core electrons, the 
valence electron is under the influence of an effective charge of one 
unit. If we neglect entirely the perturbations caused by the core, we 
obtain the hydrogen terms. Now it has been known for a long time 
that the alkali spectra have considerable similarity to the hydrogen 
spectrum. It is to be expected, then, that the deviations can be 
explained as disturbances caused by the -electric field of the core. We 
seek to take this field into account by adding to the Coulomb potential 
of the effective unit nuclear charge a spherically symmetric perturbing 
potential in the form of a power series 

+ (42) 

Since the added field also represents a central force directed toward 
the nucleus, the orbit must remain plane, according to p. 89. Thus 
its orientation in space remains fixed and so the angle variable to 3 
of the Delaunay elements (p. 136) remains constant. However, the 
complete computation of this mechanical problem shows that the 
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perturbing potential causes an advance of the perihelion, i.e. the angle 
variable w t becomes a true variable. As a result, the energy no longer 
depends on J x — J r -j- J t -\- J $ alone, but also on J 2 — J 9 + and 
we have two independent fundamental frequencies: that of the motion 
in the ellipse and that of the advance of perihelion. On the basis of the 
Quantum Theory we are thus required to introduce, in addition to the 
principal quantum number n connected with the description of the 
ellipse, a secondary quantum number k in such way that J t = kh. 
Now it can be shown that JJJ t — k/n gives the ratio of the minor to 
the major axis of the ellipse, so that we must have 1 k ^ n. The 
processing ellipse traces a rosette orbit whose energy is found as follows. 
We compare this energy with that of the ellipse which osculates the 
true orbit at aphelion; this is the path which the electron would pursue 
if the perturbing field were absent. Since the potential of the core, 
and still more the field of the core, decrease with a comparatively 
high power of the distance, the potential energies of the two orbits 
may be taken to be approximately equal at aphelion. But the kinetic 
energies are also the same, for since the centripetal force is the same, 
the centrifugal force mv^jp is also the same in both paths; moreover, 
the radius of curvature p is the same for the two osculating paths, so 
that the kinetic energies are identical. In general, the osculating ellipse 
will not be exactly a quantum orbit. We assign to it a non-integral 
effective quantum number n* by means of the definition of hydrogen- 
like terms: 

E=-^. (48) 


Since, by the Law of Areas, the part of the orbit lying close to the 
core is traversed very rapidly, while the perturbations due to the core 
electrons are appreciable only along this part, we can — to a first 
approximation — set the rotational frequencies on the two paths equal 
to each other. For the ellipse, for which there is but one quantum 
number n*, we have 

(44) 

while for the rosette we must write 


(45) 


since E depends also on k. From the equality of the two frequencies 
it follows that the difference (n — »*) cannot depend upon n itself, 
but only upoufcf „.= „+*(*) (46) 
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We obtain various term sequences according to the value of k. That 
corresponding to k = 1 is called the S sequence; k = 2 yields the P 
sequence; k = 3, the D sequence; k = 4, the F sequence. The origin 
of this notation will be explained below. 

A further improvement is obtained if, instead of setting the periods 
of ellipse and rosette equal, we assume that their difference depends 
only upon the secondary quantum number k. This assumption is 
justified by the fact that the perturbation takes place mainly on the 
inner portion of the orbit, and this is determined primarily by the 
eccentricity, i.e. by the secondary quantum number k. Thus is obtained 
the “ Ritz Term ” 



which represents each term sequence h — const, by means of two 
constants </>(k) and tp(k). This formula yields an excellent represen- 
tation of the observed data. 

The possibilities of transition between the energy levels corre- 
sponding to various values of k are drastically limited by the Corre- 
spondence Principle. It is true that the Fourier analysis of the motion 
of a rotating Kepler ellipse yields all harmonics of the orbital fre- 
quency v x , but on account of the uniform advance of perihelion no 
harmonics of the precessional frequency v 2 result; v a occurs only in 
the combination + v a . In terms of the Correspondence Principle 
this means that a change of the principal quantum number is always 
connected with a change of the secondary quantum nurnber by +1. 
This implies that an S term can combine only with a P term, a P term 
only with an jS or a D term. In this way, we interpret the empirically 
found series for which the following notation is usual: 

•=*(i>,' p=M)> «*■=*( 3), f=m 

0=^(1), w = ^(2), S=^(3), <j> — i/»(4) 

„ 0 _ R- . R 

w + * + ^2] 

nP = - — or, more precisely, 

( n + P? 


[”+p+£j 


(n+s)2 




'[ 
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The series are then written simply: 

Principal Series: v = 1/S — nP, 

Sharp Series: v == 2 P — nS, 

Diffuse Series: v — 2P — nD, 

Fundamental Series: v = 3D — nF, 


n = 2, 3, 4 , . . . 
n= 2,3,4,... 
n = 3, 4, 5, . . . 
n — 4, 5, 6 


This terminology has the, following origin. The Principal Series con- 
tains those lines which the normal atom can absorb, since the fixed 
term with which the P sequence 'combines is the ground term. These 
lines are also the easiest to excite in emission. The names of the Sharp 
and Diffuse series arise from insignificant differences in the sharpness 
of the lines. In the German literature these series are called the “ Second 
Subordinate Series ” and “ First Subordinate Series ” respectively. 
It is to be noticed that they have the same series limit. The Funda- 
mental Series is so called because it was considered of particular sig- 
nificance on account of the fact that its terms differ but little from 
those of hydrogen. German writers refer to this series as the “ Berg- 
mann Series ”, after its discoverer. All possible combinations are not 
covered by these series, however. Apart from the lines fitting into the 
historic series, a large number of “ combination lines ”, e.g. 3D — 3P, 
are known. 

Besides the spectra of neutral atoms of the first column of the 
periodic table, the spectra of singly-ionized alkaline earth metals and 
of doubly-ionized atoms of the third column are also of the alkali type. 
We note that in all these cases only one valence electron is present. 
The factor Z 2 which must be supplied in the term formula is equal to 
4 for the singly-ionized alkaline earths, and 9 for the doubly-ionized 
elements of the third column. But we can distinguish the same series 
also for the elements with two valence electrons, i.e. the neutral atoms 
of the second column. f We must imagine these series to originate when 
one of the two valence electrons occupies the orbits corresponding to 
the several energy levels, the other valence electron being counted as 
part of the core in computing the terms; i.e. for a spherically sym- 
metric perturbing potential we again imagine its charge spread over 
a sphere. The fact that this approximation is better than might he 
expected is explained by the Wave Mechanics. 

Besides the lines fitting into this system, it has been known for a 
long time -that there are lines which originate by combination with a 
new sequence of energy levels empirically called the P' term sequence. 
The remarkable feature of these terms is that they become negative! 
for higher values of the current number and converge to a negative 

* These current numbers are not the actual principal quantum numbers, but are 
the nearest integers to the effective quantum numbers. 

f The difference between systems with one and with two valence electrons is maid- 
tested by the “ fine structure ” of the lines. This is discussed below. 
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limi t. This evidently has the following meaning. If one of the valence 
electrons is raised to higher and higher P' levels and finally freed 
from the atom, the amount of work required exceeds the amount 
needed to remove it by way of the normal energy levels by the magni- 
tude of the energy of the negative limit. But this work can be done 
only on the remainder of the atom, i.e. the limit of the P’ terms corre- 
sponds to an ion with an excited electron. Thus the accented terms 
belong to atomic states in which both valence electrons are raised 
simultaneously. Then the difference oo P — oo P' must correspond to. 
the difference between two terms of the ionized atom. As a matter of 
fact, for calcium, for example, this difference is equal to that of the 
IS and 3 D states of the ion. 

The distribution of the energy of excitation among several elec- 
trons makes the spectra of elements with a number of equivalent 
electrons very complicated. The development of series becomes less 
and less pronounced, since the energy of excitation is employed to lift 
several electrons, instead of raising a single electron to a high orbit. 
However, the abundance of lines in such a spectrum is due in part also 
to a further difference which we shall examine in the following Sections. 

Before we take up these questions we shall attempt to give a more 
complete physical foundation for the potential of the core. If we 
compare the spectroscopically determined values of the functions <f> 
and ip in the Ritz formula with those which would result mathemati- 
cally from given coefficients c 2 , c 3 , c 4 , . . . , we find that for the alkalis, 
from k = 3 onward, the coefficient c 4 of 1/r 4 predominates. This may 
be interpreted as follows. Under the influence of the valence electron, 
the electron shell will be displaced slightly in a direction away from 
the valence electron. As a result the centre of gravity of the positive 
charge will no longer coincide with that of the negative charge, and a 
dipole is formed. If the polarizability of the ion is a, the dipole 
moment will be | m | — ae/r 2 ; but its potential at the place occupied 
by the electron is 



Actually we obtain values of a from the Rydberg and Ritz corrections 
which agree well with refraction data on the ion corresponding to 
the core (cf. p. 452). For small values of k, i.e. for S and P terms, the 
Rydberg correction becomes so large that it can no longer be explained 
by polarization of the core. Schrodinger was able to show that large 
Rydberg corrections of this kind result when the valence electron 
penetrates into the region of the core electrons, which we imagine 
distributed over a spherical shell. Our derivation of the Rydberg 
formula makes no special assumptions as to the nature of the per- 
turbing field, and so is valid for both cases. 
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8. Perturbation of Electron Orbits by External Forces. Stark Effect 
and Zeeman Effect. 

The methods of Celestial Mechanics, whose fundamentals were 
derived in Chapter VI, Part II of this book (p. 108), together with the 
quantum conditions, permit us to calculate the perturbations of the 
term values caused by external electric or magnetic fields. The calcu- 
lation becomes particularly simple when based upon two theorems 
which must be given here without proof. 

The first theorem, the so-called Adiabatic Principle, states that if 
perturbations are produced in a system by slowly altering the external 
conditions, the energy is in general changed, but the phase integrals 
retain their former values. If, then, a state is once characterized as 
a quantum state by values of the phase integrals which are integral 
multiples of A, it remains so when the system undergoes an adiabatic 
change. The meaning of the theorem may be illustrated by means of 
the so-called Einstein pendulum. Let a pendulum bob suspended by 
a thread swing with an amplitude corresponding to a quantum state 
nhv of an oscillator. Let the frequency be raised to v by slowly 
shortening the thread at the point of support. As may be verified 
readily in this case, the pendulum takes up enough energy to give it 
an amplitude corresponding to nhv' . 

The second theorem, which becomes evident immediately, states 
that if there is a perturbing function in the expression for the energy, 
the value of the additional energy due to this cause is, to a first approxi- 
mation, equal to the mean value of the perturbing energy taken over 
the unperturbed orbit. For example, if a hydrogen atom is brought 
into a homogeneous electric field F whose direction is that of the 
z-axis, a perturbation term —eFz is to be added to the Coulomb term 
—eP/r in the expression for the energy, and the average change in 
energy is —eFz. Naturally, this depends on the eccentricity and 
orientation of the orbit, i.e. the orbits having the same major axis — 
whifch have the same energy in the unperturbed atom — assume dif- 
ferent energy values when the field is applied; in other words, the 
energy levels are subdivided. This subdivision is very complicated in 
the case of hydrogen, and it must be looked upon as one of the finest 
accomplishments of the theory that it makes possible the compu- 
tation of these patterns. This phenomenon is the Stark Effect, dis- 
covered independently by Stark and by Lo Surdo. The theory shows 
at once that the subdivision of the levels occurs in this form only for 
one-electron systems. For multiple-electron systems a splitting into 
orbits having various eccentricities but equal major axes is already 
caused by the field of the core. This is merely the decomposition into 
S, P, D term sequences. The external field can exert but a small 
additional influence which manifests itself in a displacement increasing 
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with the square of the field strength and, occasionally, by splitting. 
On the other hand, the field has a great influence on the intensity of 
the lines of these atoms; the field gives rise to harmonics which, accord- 
ing to the Correspondence Principle, make possible transitions which 
cannot occur in the absence of the field. Since very intense local fields 
occur in condensed spark discharges, we see why many “ forbidden ” 
lines appear under such conditions. 

The perturbations caused by a magnetic field are of even greater 
importance in the investigation of the structure of the atom, for a 
field of this kind causes the spectral lines of the atoms to subdivide in 
very characteristic ways. ThiB is the Zeeman Effect. Application of 
the second perturbation theorem leads into difficulty, since the force 
— e[vE]/c exerted by a magnetic field on an electron has no potential. 
One might even believe, at first, that a magnetic field could cause no 
change in energy at aU, since the force is always normal to the path, 
and so can do no work on the electron. This is, of course* true for the 
field itself, but it must be remembered that while the magnetic field 
is building up there exists a time derivative 3H/3«, and hence, by the 
second Maxwell equation, an electric vorticity (cf. Ex. 89, p. 316). 
This curl determines an energy change, but at the same time the orbit 
remains a quantum orbit on account of the Adiabatic Principle. For 
simplicity, we shall assume the field to increase at a uniform rate, 
attaining its full value after N electronic rotations. Then we may 
replace dE/dt in the Maxwell equation by H /N T . The work done by 
the electric field during the time of one revolution of the electron is 

— = — e(j)Eds = — e (curl E)S= ? S, 

where S denotes the area and orientation of the orbit. The total work 
done after N revolutions is then 

TF^|h. ....... (50) 

On account of the Law of Areas we can replace 8/T by the areal 
velocity dS/dt and express this in turn by the mechanical moment of 
momentum d. Then we have 

* • < 51 > 

But according to p. 463, the magnetic moment of an electron orbit 
whose moment of momentum is d is given by 





so that 


(52) 

(53) 
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The utork done in increasing the field from zero to its final value H is 
equal to the energy of the magnetic dipole of strength f> in the field H. 

In what form is this energy present in the atom? Since there is 
no such quantity as potential energy of an electron in a magnetic field, 
this energy can be present only as kinetic energy. To a first approxi- 
mation the change of motion can be given very simply. We prove 
a theorem, given by Larmor, to the effect that to a first approxima- 
tion the only effect of the magnetic field is to cause a uniform rotation 
of the entire electron system about the field as an axis, the direction 
and magnitude of the rotation being given by 


(54) 


If the co-ordinate system to which the electron motions are referred 
is in rotation with respect to an inertial system, the following inertial 
forces arise (cf. p. 233): the Coriolis force 

F 0 = — 2m [iot> re [], (55) 

and the centrifugal force 

F r = — m[cn [tor]]. 

If the product [cor] is small compared with the velocity in the moving 
system — which is, of course, the case here — then the centrifugal force 
can be neglected in comparison with the Coriolis force. The force 
exerted by the magnetic field, on the other hand, is 

F m = — - [irH], (56) 


where, strictly, V should be replaced by the absolute velocity 
v Tel -(- [cor], but may .be replaced approximately by t> rel . The two 
forces neutralize each other, i.e. determine the same electron orbits in 
the moving system as those which would be present without the field 
in the system at rest, if 

= — ■ [s’rei H] 5- )- 2m [t>«i co] = 0, or co=~H. (57) 

This is known as the Larmor precession. The corresponding change 
in kinetic energy — which may be positive or negative according to 
the relative positions of co and the orbital angular momentum— is the 
work done by the field. In what follows we shall use the convenient 
formula (53) to determine the change of the term values. 

As long as any arbitrary orientation of f> — or, what amounts to 
the same thing, of the angular momentum vector d — with respect to 
the field H is permissible, we obtain a continuous manifold of energy 
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changes. This means that a quantized energy value would become 
diffuse in the field, so that no splitting of lines but only a broadening 
could take pltwe in a magnetic field. However, it must be remembered 
that the Larmor precession of a Kepler ellipse determines a uniform 
rotation of the line of nodes, and the third Delaunay variable w 3 
becomes a true variable (cf. p. 136). Then, according to § 6 (p. 656), 
the associated action variable J 3 must become an integral multiple 
of h. This quantity represents 277 times the z-eomponent of the moment 
of momentum. Since 2w times the total angular momentum is itself 
an action variable J 2 , which is an integral multiple of h, it follows 
that if we set J 3 = mh the inclination of the orbit is given by 



We thus arrive at a directional quantization. For k — 1 only the 
values m — +1, 0, —1, i.e. the angles 0, tt/2, it are possible. It is to 
be noticed that the quantum numbers n and k are essentially positive 
quantities, but the “ magnetic ” quantum number in, on the other 
hand, can assume both positive and negative values since an external 
field determines a definite sense as well as direction. 

This directional (or spatial) quantization, which is of especial 
importance in the Theory of Paramagnetism, was directly verified by 
the experiments of Stern and Gerlach.* In these experiments a fine 
pencil of atoms of silver or some other metal was produced by 
evaporating the metal in a high vacuum. The beam was sent through 
a non-homogeneous magnetic field whose gradient grad j H | had 
practically the same direction as H. Such a field exists, for example, 
between two pole pieces, one of which is a fiat plate, the other a sharp 
edge.f A magnetic dipole experiences a translational force in such a 
field (cf. Ex. 78, p. 286); this force is in the direction of H and is 
given by 

|F| = |^| | grad HI cos (^H). . . . (59) 

If, as would be expected in the classical statistics, all orientations 
of occurred as a result of the thermal motion, the beam would 
experience a continuous broadening in the magnetic field. In reality, 
however, the beam of silver atoms was found to break up into two 
parts, corresponding to the orientations 0 and rt. There is a particular 
reason for the absence of the orientation 7r/2 (cf. p. 670). 

The Quantum Theory provides a fundamental unit for the magni- 
tude of the magnetic moment, the Bohr Magneton. Since the total 
angular momentum is an action variable and so must be an integral 

* 0. Stem and W. Gerlach, Ann. der Physik, 74, p. 673 (1924). 

t This type of field is not '■trictly of zero divergence. In carrying out the experi- 
ment it is even more closely approximated by means of a groove of rectangular cross- 
section that is out in the plate. 
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multiple of A/2w (this holds for all atoms, not only for hydrogen), it 
follows from the magneto-mechanical parallelism (p. 463) that the 
magnetic moment must also be an integral multiple of the quantity 

'‘*=iWo- M7xl0_ " em ' 1 - • • • (60) 

The atomic ray experiments actually gave a deviation for silver which 
yielded | ft I = I'M*, but one cannot expect the susceptibility measure- 
ments evaluated according to p. 458 to give integral numbers of mag- 
netons, since the directional quantization is not assumed in this evalua- 
tion (cf. p. 672). 

With the aid of equation (53) (p. 663) and equation (60), we can 
write at once the energy changes experienced in a magnetic field by 
an atom with secondary quantum number k, i.e. with moment of 
momentum kh/2ir: 

* AE — — -£H = + f i Jell cos i — ± [i B Hm 

= + m,0^\m\<.k. , (61) 

“ 4 irm A c 

In order to find how the spectral lines are split up, we must derive 
selection principles for the spatial quantum number m.- We again 
turn to the Correspondence Principle and use it to determine the 
nature of the light observed in various directions when the radiation 
takes place according to the classical theory. We can determine the 
time rates of change of the components of the dipole moment from 
the motions of the electrons. By p. 336 et seq. these quantities 
determine the radiation of the “ antenna system ”. At the same time, 
we must bear in mind the fact that a Hertzian oscillator emits no 
radiation in the direction of its axis. Hence, if we observe in the direc- 
tion of the magnetic field (“ longitudinal effect ”) we do not perceive 
the z-component of the motion of the electrons, which is not affected 
by the field. The x- and ^-components, which execute the Larmor 
precession, are observed completely. The Fourier analysis yields 
combination frequencies of the form gv t + v 3 + v 3 . In accordance 
with the Correspondence Principle, this means that only the quantum 
transition Am — +1 occurs for radiation in the direction of the field, 
and that since the classical radiation is circularly polarized in the 
initial and final orbits, the emitted light is circularly polarized. In 
observing perpendicular to the field (“ transverse effect ”), the unin- 
fluenced component is observed completely. Its Fourier analysis does 
not contain v 3 , but only frequencies of the form s iq + i» 2 , so that we 
have quantum transitions Am — 0 connected with radiation normal 
to the field. The light vibrations are then parallel to the field. Of 
the transverse components of motion, i.e. the x- and y-components, 
the one in the line of sight will not be observed) the light sent out by 
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it is linearly polarized in g direction normal to the field. Since the 
combinations sv, + v 2 + n s again occur in the Fourier analysis, 
the transitions Am = +1 correspond to them. Then, on account 
of the fact that — m f is limited to +1 or 0, the lines are divided in 
the following way: 


= XA eh ?~. (m, - m f ) = 0 or Av = ± (62) 

bArm^c' iirmac 


In the longitudinal effect the value 0 is excluded and the light of both 
components is either right-handed or left-handed circularly polarized,* 
while in the transverse effect the undisplaced line is polarized in a 
direction parallel to the field, the side components in a direction normal 
to the field. This pattern of lines is called the normal Lorentz Triplet. 
The theory of this pattern was derived by Lorentz using the simple 
linear oscillator as a model of the radiating atom.f The Quantum 
Theory goes no farther; this is strange, since the normal Zeeman 
efEect is shown by relatively few lines, and most lines exhibit much 
more complicated patterns containing large numbers of components — 
the so-called anomalous Zeeman effect. Nevertheless, regardless of 
how complicated the atom may be, the total angular momentum being 
constant in time is an action variable, and hence subject to the quan- 
tum condition; thus our formula is generally valid as long as the 
normal connexion between moment of momentum and magnetic 
moment obtains. 


9. Difficulties arising in the Atomic Theory Of Magnetism. Explanation 
in Terms of the Spinning Electron. 

The anomalous Zeeman effect is not the only difficulty ofEered by 
magnetism in the mechanics of the atom. The fine structure of many 
lines, e.g. the division of the familiar yellow line of sodium IN — 2 P 
iato two components A5890 and A5896 A.U. even in the absence of a 
field, is still unexplained. The components of this multiplet are really 
connected, and are not formed merely by lines which happen to lie 
near each other: if these lines are examined in a magnetic field of 
increasing strength, the pattern changes when the separation of the 
Zeeman components becomes comparable with that of the original 
fines and passes over into a single normal Lorentz triplet for very strong 
fields. This is known as the Paschen-BacTc Effect. One would conclude, 
then, that the doublet is caused by an internal magnetic field, and 
the location of this field would be assumed to be the core. But this 
cannot be, for the alkali ions corresponding to the core are known to 
be diamagnetic, and so are incapable of generating a field. 

* It is convenient to remember that the short-wave component v + the rotates in 
the same sense as the ourrenfc which would cause the external field. 

t S. 4- Lorentz, The Theory of Electrons, London, Macmillan & Co. (1909). 
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Atoms with several valence electrons exhibit multiplets of much 
greater complexity. According to our previous computations these 
components should all come together to form a single line. For sim- 
plicity we consider not the splitting of the lines, which are always 
given by the difference of two terms, but rather that of the terms 
themselves. The 8 term is always single. The multiplicity increases 
with X; up to a maximum value and remains constant thereafter; the 
multiplicity does not depend on the principal quantum number, and 
so is a characteristic quantity for each term sequence. The term 
system is designated as singlet, doublet, triplet, . . . according to the 
highest multiplicity. This multiplicity is written as a superscript to 
the left of the term symbol; thus a S signifies the “ triplet S state ” of 
the atom. An atom with several valence electrons has several term 
systems with various multiplicities. For example, the alkaline earths 
possess a singlet system and a triplet system. As we proceed from 
column to column of the periodic table the even and odd multiplicities 
alternate. The multiplicities of a singly-ionized atom correspond to 
those of the preceding neutral atom. The highest multiplicity in the 
term system of an atom is greater by one than the number of valence 
electrons, i.e. one larger than the number of the column of the periodic 
table in which the atom belongs. Thus, for an element of column VII 
— e.g. manganese — octuple terms can occur. Were it not for selection 
rules, 64 lines could result from combinations between two such octet 
levels. Despite the fact that this number is reduced by the existence 
of selection rules, the tremendous richness of the spectra of atoms 
with many equivalently bound electrons is easy to understand. An 
analysis, however, yields a representation of the spectrum as com- 
binations of a moderate number of multiplet terms. The lines re- 
sulting from the division of the .terms may be represented formally 
by giving each sub-level a new quantum number j, introduced by 
Sommerfeld as the “ inner ” quantum number. The selection rule for 
3 i® 

Aj — 0 or +1, with the transition 0 -> 0 forbidden. 

Coxlbeming the physical significance of j, Sommerfeld surmised at the 
very first that jhj2n represents the total angular momentum of the 
atom; this may now be considered a proven faot. For an alkali atom, 
whose core is not magnetic and which therefore has no angular momen- 
tum, j would then appear to be identical with h, the angular momentum 
of the orbital motion of the valence electron. But this is evidently 
not the case, for there are two different values of j corresponding 
to the same k in the P term; hence there must be something 
else in the atom which is magnetic and which possesses angular 
momentum. 

A further remarkable fact is the following: The spectra of the one- 
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electron systems H and He+ also exhibit “ fine structure ” which, how- 
ever, is quantitatively explained by a theory taking into account the 
relativistic variability of mass of the electron. Even in the absence 
of perturbations by the core, the orbits of various eccentricities have 
different energies on this account. For the alkalis, the splitting due to 
the different eccentricities is greatly increased by the core potential 
and the orbits of different eccentricity are separated into S, P, D terms, 
so that the relativistic effect plays a comparatively insignificant part. 
Nevertheless, the alkali doublets follow the same law as the relativistic 
hydrogen doublets as far as the dependence on nuclear charge is con- 
cerned. This might lead one to abandon the magnetic interpretation 
of the alkali doublets, yet there is still the possibility that both types 
of division follow the same law as a matter of chance. 

Finally, we refer in this connexion to the magneto-mechanical 
anomaly found in the Einstein-de Haas Effect (p. 463), in which values 
of the magnetism of -ferromagnetic substances result which are twice 
as great as those to be expected according to the magneto-mechanical 
parallelism. 

All of these difficulties were removed at one stroke by Goudsmit 
and Uhlenbeck by their bold assumption that the electron itself possesses 
a magnetic moment and a mechanical angular momentum.* The mag- 
netic moment is assumed to be one Bohr magneton, while the moment 
of momentum is taken to be only l/2(A/27r). These assumptions are 
justified by their results. The magnetic moment may be pictured as 
originating from rotation of the electronic charge about an axis, and 
it is not difficult to find the distribution of charge which gives the 
correct ratio of magnetic moment to angular momentum. However, 
no deeper meaning is to be attached to such a detailed picture of the 
structure of the electron, since it cannot be verified in any way. Never- 
theless, this concept led to the term spinning electron. We refer to the 
inherent angular momentum as the spin of the electron. It is now 
understood that the magneto-mechanical anomaly for ferromagnetic 
substances — which are, without exception, metals — is due to the con- 
duction electrons, while for salts — where the magnetism is the resultant 
of the orbital and spin moments — the factor lies between 1 and 2. 

10. The Theory of Multiplets and of their .Zeeman Effects. Quantum 
Theory of Paramagnetic Susceptibility. 

With the aid of the concept of electron spin one can readily bring 
order out of the chaos of multiplets. We limit our considerations to 
so-called normal multiplets for which the interaction of the electrons 
— called LS or Eussell-Saunders coupling — may be represented by the 
following vector construction: 

1. All spin vectors s. ; may be combined algebraically into a resultant 
• S. Goudsmit and G. Uhlenbeck, Zeitsch f. Pkya., 35, 8-9, p. 618 (1926). 



670 ATOMS, MOLECULES AND SPECTRA [Chap. 

spin 3 which is an integer or a half-integer (measured in units of h/2n): 
8 = Zs, = 2 + f 

ThuB for two electrons the resultant spin is either 0 or 1. 

2. All orbital angular momenta are to be combined vectorial ly to 
yield an integral resultant. Here we must introduce a modification 
which shows, for the first time, the insufficiency of our mechanical 
models: In order to keep the results in harmony with experience we 
find it necessary to decrease the orbital angular momenta by one 
unit. If these were the actual momenta, it would mean that the 8 
orbits possess zero angular momentum, and would thus be straight 
lines through the nucleus. Only the newer quantum mechanics, which 
abstains from all special models, is capable of leading compulsorily to 
this numeration. One less than the secondary quantum number for 
each electron is called l t . Thus l ( — k t — 1. 
The integral resultant for several electrons is 
called the group quantum number l. The 
symbols S, P, D will be applied to the group 
quantum numbers, while the corresponding 
small letters will be used for the individual 
electrons. 

3. The resultants s and / are to be com- 
bined vectoriaUy into a resultant j which is 
integral (including zero) or half-integral, ac- 
cording as s is one or the other. The various 
integral or half-integral resultants are f ound by 
means of the following construction. Assume 
l > s. Describe a circle of jradius s about the 
terminus of the vector I(fig. 3). The terminus 
of the resultant must lie on this circle. Its in- 
tegral or half-integral values are constructed by drawing circles about 
the initial point of l, using the integers and half-integers as radii. The 
intersections of these circles with that of radius s mark the possible 
end points of the resultant vector,/. It is seen that for l > a the number 
of possibilities, for integral or half-integral spin resultant, is 2s 4- 1. 
This is then the multiplicity r of a term system corresponding to the 
spin resultant s. Since the latter is at most Nj2 when there are N 
valence electrons, the ma x imu m multiplicity is 2V,+ 1. The other 
empirical rules given on p. 668 are obtained with equal ease from this 
construction. We attach the quantum number j to the term symbol 
as a subscript, e.g. 2 a 2\/r 

4. In a weak external magnetic field (additional magnetic energy 
small compared with the natural multiplet separation) the vector,/ 
sets itself with respect to- the field in such a way that its projection on 
the field direction is a positive or negative integer or half-integer 


7*5 



Fig. 3 
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according as j itself is one or the other. This gives (2 j + 1) positions, 
since the projection 0 is allowable only when j is integral. 

6. In a strong magnetic field, on the other hand, the coupling 
between the orbital and spin vectora is dissolved; the resultant orbital 
angular momentum orients itself in such manner that its projection in 
the direction of the field is a whole number and, similarly, the resultant 
spin sets itself so that its projection on the field is a half-integer. 

These five rules not only embody a correct representation of multi- 
plets, but also lead to a formula for their Zeeman effect which is almost 
exactly correct. 

If we now compute the additional energy due to the magnetic 
field by means of the expression — ^>H, we are not permitted to assume 
the normal connexion between the total angular momentum jh/2v 
and the magnetic moment, for j is a combina- 
tion of the normal orbital angular momentum h 
with the anomalous spin. The true relation- 
ship is obtained from the following considera- 
tions: the invariable axis of the atom is given 
by the direction of the total angular momentum 
vector j. The vector of the resultant orbital 
angular momentum l precesses about this line. 

Since j is by construction the resultant of l 
and s, this means that the entire triangle OLS 
turns about its longest side. On the average, 
then, only the components of the vectors / and 
s in the direction oij contribute to the magnetic 
moment. On account of the anomaly of the Fi «-* 

spin, the component of s must be included 
twice, so that the magnetic moment becomes (cf. fig. 4) 



i*i“55b w+ * ,, “ wa • • 

(63) 

and since 

cos (sj)- sZ + ^- 1 \ . . . . 

• (64) 

we have 

2s; ’ 

• (66) 


The factor in brackets, g, which measures the ratio of the actual mag- 
netic moment to the one to be expected in the normal case, is called 
the gyrornagnetic ratio. The above value, however, does not yet corres- 
pond with experiment. The strange thing is that the correct formula 
is obtained only after replacing each x 2 by x(x -j- 1), i.e. 



. , , Jti + 1) + 4s + l) -1(1 + 1) 

9 ~ 1+ m+T) • 


. • (66) 
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This again demonstrates that the relations derived from models re- 
produce the facts approximately, but not absolutely correctly. It is 
only the new Quantum Theory which is capable of justifying formula 
( 66 ). 

The fact that g has different values jn the initial and final states 
gives rise to the large number of components in the multiplet pattern. 
It follows from the energy increments 

= y. M g i m i H and A E f — g, s g/m f H . . (67) 

that the it components of the transverse effect (polarized with the 
electric vector parallel to H), for which m t — m f — +m, are given by 
hAv w = + fi s Hm(g t — g f ), .... (68) 

and the a components of the transverse effect (polarized with the 
electric vector perpendicular to H), for which m i = m +1, m t — +m, 
are given by " 

±pJR{m{g l — g f ) ±^]. . . (69) 

As we see, the normal pattern is obtained for g t = g f — 1. 

If, however, the magnetic field is very strong, the resultant orbital 
and spin angular momenta orient themselves independently, and we 
have the energy increment given by 

AE — /***»,# -f 2/x J |Wv f JJ, .... (70) 

where t»j and m, are the projections of / and « on the direction of H; 
and since m, can have only the values +1/2 or — 1/2, the energy 
increment is that found in the normal effect. 

On the basis of these considerations the calculation of magnetic 
susceptibility performed on p. 458 must be revised also. There we 
found that the magnetization of unit volume was given, according to 
the classical statistics, by 

Referring all quantities to one mol, the susceptibility then becomes 
*=<M s 5?3. (71) 

Since cos 2 Q — 1/3 in the classical statistics, we have a unique con- 
nexion between the susceptibility and the atomic moment p. Accord- 
ing to quantum principles, 

7> (72) 

and since only 2 j + 1 orientations — viz. those for which the magnetic 



quantum number m if 
cos 2 # becomes 
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whole number; — are allowable, the mean value 


_ 2j(j+l)(2j+ 1) _ ,j ± 1 

(2j+ 1)^.6 j ' X > 


This r formula is valid for integral as well as half-integral values of j, 
as one may verify readily. Then we have 


+ i)f*/ 

x WT 


(74) 


On account of (72), the Curie constant, which was L 2 p 2 jZR in the 
classical case, is now 


„ Lyj(j+i)fi*_L*p*(j + i) 
^ ~ 3 R 3 R j ‘ 


. . (75) 


In order to calculate the elementary magnetic moment p from the 
observed value of C it would be necessary to know the value of j cor- 
responding to th§ atomic state in question. One must therefore pro- 
ceed in the opposite way by calculating g and then C from the values 
of the quantum numbers l, s and j determined in some other way — 
spectroscopically or purely theoretically — and then compare the 
results with observation. A test of (75) which is free from objection 
must be undertaken by means of susceptibility measurements on para- 
magnetic, monatomic gases, since it is only here that disturbing in- 
fluences are excluded. Unfortunately, the experimental obstacles have 
not yet been overcome, so that One must turn to experiments on 
solutions containing paramagnetic ions. Among the ions of the rare 
earths we find very good agreement between theory and experiment; 
this is understandable because, unlike the ions of the iron group, the 
electron shell responsible for ‘paramagnetism is not at the surface and 
so is not subject to disturbances from neighbouring molecules. These 
disturbances prevent application of (75) in the case of the iron group, 
but at the same time they make conditions simpler. To a first 
approximation take all the terms of the ions of the irdn group to be 
S .terms. In this instance equation (75) becomes 


0 


4 ZflSQS + 1) , 
3 R ’ 


(75') 


i.e. the effective number ofi Bohr magnetons becomes 

p.„ = VmTT), i . (75") 

The LS coupling treated here is not always developed to the exclusion of 
everything else. Among the heavier elements, transition types, representing the 
analogue of an incipient Paschen-Back effect, appear. Here we encounter for the 
23 < E m> 
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first time the general principle that the orientation of the spin vectors conforms 
to the magnitude of the energy of interaction. In a weak field the coupling 
between / and s overshadows that between the magnets and the field, so that 
l and s are first oombined to produce j, whioh then orients itself in the field. Rut 
in a strong field both the coupling between l and the field, and that between s 
and the field, exceed that between l and s. By the same principle the LS coupling 
!b efieotive only when the energy of coupling between the orbit and spin of an 
electron is small compared with the magnitudes of these energies among the several 
electrons. The spin-orbit coupling energy increases rapidly with the efieotive 
nuclear charge, so that the conditions no longer apply to heavy atoms, particu- 
larly those with many valence electrons. The other extreme case, called jj coup- 
ling, is encountered, for example, in the spectra of multiply ionized lead. In this 
case the spin and orbital momentum of an individual electron combine into a 
vector j\ — l t d; i- The total angular momentum is the integral or half-integral 
vector sum of the j\, but there is no resultant / or s, so that the nomenclature 
based on LS coupling cannot be applied at all. If this is occasionally done in 
connexion with, say, Pb terms, it is by analogy with the terms of homologous 
systems having smaller nuclear charge. X-ray terms, too, are of the jj type. Of 
course the g formula becomes quit© different for jj coupling. A different g value 
must be substituted in the first part of (76), but the second part remains un- 
changed because it contains only the magnetic moment and the quantum number 
J , whioh is valid in both oases. 

Ex. 132. Sketoh the relative positions of the term3 arising in the case of two 
electrons with Z, = 1 and Z 2 = 0 for (a) LS coupling, (6) jj ooupling. 

Ex. 133. The Faraday Effect is the rotation of the plane of polarization of 
a beam of light when passed through a dense transparent rfiedium situated in 
a magnetic field, the ray being directed along the field. Develop an approxi- 
mate theory of this effeot by combining the- results obtained for the Zeeman 
Effect with the dispersion formula of p. 463. 

11. Tbe Structure of the Teriodie System of the Elements. Pauli’s 
Principle. 

After arriving at the concept that the electrons of the more com- 
plicated atoms are arranged in shells having various quantum numbers, 
it was natural to assume that a new shell is begun with each period, i.e. 
each time we come to an alkali metal. The principal quantum number 
of the valence electron would therefore have the values Id 2, Fa 3, 
K 4, Rb 5, Cs 6. Hence the lowest term of Na, which corresponds to 
the normal state, must be given the principal quantum number 3. 
But there is evidence against this, for the effective quantum number 
«* obtained from the spectroscopic term value lies in the neighbour- 
hood of 1-5. There must therefore be a very large Rydberg correction. 
That this is really the case is shown by the following considerations: 
Let us examine the similar systems Mg+, A1++, Si +3 , P+ 4 , S +s * in 
which there is likewise one valence electron and a core consisting of 
ten electrons. It is to be expected that as the effective nuclear charge 
— which amounts to 2 for Mg+, 3 for A1++, &c. — increases, the per- 

* Fowler introduood another notation for the speotra of atoms in- various stages 
of ionization: the “ are ”, spark ’’ and “ super-spark ” speotra of an element E are 
referred to as E lt E v E-. &c., in this order. 
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turbations caused by the core will decrease. If we plot the values of 
the IS, 2 P and 3 D terms as functions of the atomic number, at the 
same time dividing each term by Z® in order to bring them to the same 
scale, we see from fig. 5 that both the 8 and the P terms converge 
toward the value of the three-quantum hydrogen level. Thus for the 
sodium atom the perturbation (caused by penetration of the orbit into the 
core) is really as great as indicated, and the above assignment of principal 
quantum numbers for the valence electron of the alkalis is justified. 

We follow Bohr in assuming an atom to be built up by adding 
electrons successively to a bare nucleus. It is then natural to assume 
that the quantum numbers of 
the orbits of the first, second, 
third, &c., electrons are in- 
dependent of Z, so that a 
system containing N electrons 
would be a model of the atom . 
of atomic number N* This 
rule, suggestive of the Funda - 
mental Law of Biogenetics, is in 
general obeyed, but is violated 
at certain places in the periodic 
table, i.e. for certain values of 
Z, for which a reconstruction 
takes place. That is, if we con- 
sider the spectroscopically de- 
termined quantum numbers 
corresponding to the lowest 
terms — the normal states — of 
the atom, we find that the 

electrons in the first row are in a orbits. There is indeed no other 
possibility when n — 1 , since k^n or, otherwise expressed, 1 5* n — 1 . 
In the second row we find, besides s orbits, p orbits corresponding 
to the principal quantum number 2. The ground states of the 
atoms of the third row — the elements of which correspond both 
chemically^ Snd spectroscopically to those immediately above them — 
also have only s and p Crbits, although, a priori, d orbits are also to be 
expected for n — 3., In Bow 4 there are certainly 4a orbits for K and 
Ca. Now the following elements. Sc to Cu, exhibit spectroscopic and 
chemical behaviour quite different from that of the elements of the 
preceding row.. Spectroscopically they arc distinguished by high 
multiplicities of the teriris. while in the previous rows the high multi- 
plicities possible according to the number of valence electrons (i.e. the 
column number) are not observed. The elements Sc to Cu are charac- 

i six* because of the greater unclear 



Na Mf AL** Si 3 * P** S s+ 




slii? 
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terized by the occurrence of many valences and by the formation of 
complex ions which are highly coloured and are paramagnetic, while 
the ions of all previous elements absorb light only in the far ultra- 
violet -and are diamagnetic. For this reason it has long been held that 
the 3 d orbits, which do not appear up to this point, are inserted here. 
Bohr drew this conclusion from an extrapolation which has since been 
confirmed by analysis of the Sc++ spectrum: if the values of the lowest 
S, P and D terms for K and Ca + are plotted as functions of the atomic 
number, the extrapolation to Sc++ shows that the largest term corre- 
sponding to the lowest energy state is a D term. Thus from Z — 21 
onward, the nineteenth electron is no longer placed in a 4s orbit, but 
in a 3d orbit. The unusual properties of the elements in the sequence 
Sc to Ni are attributable to the fact that all their electrons over 18 
belong to an unclosed group. It is only when we get to Cu that we 
again find the essential spectroscopic features of an atom having one 
valence electron, although the occurrence of additional lines and the 
chemical bivalence indicate that the underlying shell is still vulner- 
able to outside disturbances. Since Cu has 29 electrons and has one 
4s electron, as has K (Z — 19), a total of 10 3 d electrons have been 
added in the series So to Ni. This subsequent filling out of a pro- 
visionally completed shell repeats in the following periods. The group 
of four-quantum orbits is completed with Kr, but in analogy with the 
other rare gases, this element has only 4s and 4 p orbits. Nevertheless, 
the construction of the five-quantum electron group already begins 
at Rb; the filling in of this group is interrupted at Y, i.e. at the place 
corresponding to Sc. Again it is d orbits which are supplied to the 
four-quantum group in the series Y to Pd. It must be remembered, 
however, that the 4/ orbits, which are "also possible, have not yet 
appeared. Ag, which is definitely univalent, possesses a 5s electron, 
as does Rb. The 5s and 5 p orbits are again closed at the element X, 
and the valence electron of Cs travels in a 6s orbit. The filling in of 
the 5 d group is resumed with La, the third element of the series. At 
this point there begins a series of elements which are chemically very 
similar and which have no counterparts in the elements above them. 
These are the rare earths. There is every reason to assume that the 
insertion of the 4 f group — which is not yet present— begins here. The 
number of 4 f electrons to be supplied can be guessed by extrapolation : 
According to what we have seen, the relation between the quantum 
number and the number of electrons in a completed shell is given by 


Principal quantum number n . . 

1 

2 

3 

Number of electrons . . . . 

2 = 21* i 

8 = 2-2* 

18 = 2-3* 


It is to be expected, then, that there will be in all 2-4 a = 32 four- 




ELECTRON CONFIGURATIONS OF THE ELEMENTS 


U_ 


~2t 2 p 


3 Li 1 

4 Be 2 

5 B 2 1 

8 C 2 2 

7 N 2 3 

8 0 2 4 

9 F 2 6 

10 No 2 6 


3s ‘dp 3d 


11 Na 1 

12 Mg 2 

ISA) 2 1 

14 Si 2 2 

13 P 2 3 

16 S 2 4 

17 Cl 2 5 

18 _A 2 6 

18 K" 2~ 6 

20 Ca 2 6 

21 So 2 6 1 

22 Ti 2 6 2 

23 V 2 6 3 

24 Cr 2 6 6 

25 Mn 2 6 6 

26 Fe 2 8 6 

27 Co 2 6 7 

28 Ni 2 6 8 

29 Cu 2 6 10 


4 s 4 p id 


30 Zn 2 

31 Ga 2 1 

32 Ge 2 2 

33 As 2 3 

34 Se 2 4 

36 Br 2 6 

36 Kr 2 6 


37 Rb 2 6 

38 Sr 2 6 

39 Y 2 8 1 

40 Zr 2 6 2 

al Nfc 2 6 4 

42 Mo 2 8 6 

43 Tc 2 6 (6) 

44 Ru 2 6 7 

45 Rh 2 6 8 

46 Pd 2 6 10 

47 Ag 2 6 10 

48 Cd 2 8 10 



4s 

ip 

id 

4/ 

6s 

6p 

6 d 

65 

49 In 

2 

6 

10 


2 

1 



60 Sn 

2 

6 

10 


2 

2 



61 Sb 

2 

6 

10 


2 

3 



52 Te 

2 

6 

10 


2 

4 



63 I 

2 

6 

10 


2 

5 



54 Xe 

2 

6 

10 


2 

6 



66 Cs 

2 

6 

10 


2 

6 


i 

66 Ba 

2 

8 

10 


2 

6 


2 

57 La 

2 

6 

10 


2 

6 


2 

58 Ce 

2 

6 

10 

2 

2 

6 


2 

59 Pr 

2 

6 

10 


2 

6 

(1) 

(2) 

60 Nd 

2 

6 

10 

4 

2 

6 

0 

2 

61 Pm 

2 

6 

10 

6 

2 

8 

(0) 

(2) 

62 Sm 

2 

8 

10 

8 

2 

6 

0 


63 Eu 

2 

6 

10 

7 

2 

6 

0 

2 

64 Gd 

2 

6 

10 

7 

2 

6 



65 Tb 

2 

.6 

10 

9 

2 

6 

1 

2 

66 Dy 

2 

8 

10 

10 

2 

6 

(D 

(2) 

67 He 

2 

6 

10 

11 

2 

6 

(i) 

(2) 

68 Er 

2 

6 

10 

12 

2 

6 

l 

2 

69 Tm 


6 

10 

13 


6 

0 


70 Yb 

2 

6 

10 

14 


6 

0 

2 

71 Lu 

2 

6 

10 

14 

2 

6 

] 

2 


6s 

6 P 

5 d 

6s 

Qp 

6tf 

7s 

«/ 
























Tc 

Technetium 

74 W 

2 

6 


2' 


Pm Prometheum | 

75 Re 

2 

6 


2 


At 

Astatine 

76 Oe 

2 

6 

6 

2 


Fr 

Francium 

77 Ir 

2 

6 

7" 

2 


Np Neptunium 

78 Pt 

2 

6 


1 


Pu 

Plutonium ! 


2 

6 

10 

1 


Am Americium 

80 Hg 

81 Tl 

2 

2 

6 

6 

10 

10 

2 

2 

j 

Cm Curium 

Bk Berkelium j 

82 Pb 

2 

6 

10 

2 

2 

Cf 

Californium 


2 

6 

10 

2 

3 

E 

Einsteinium 

84 Po 

2 

8 


2 

4 

Fm Fermmm 

86 At 

2 

6 

10 

2 

6 

Mv 

Mendelevium 1 

86 Rn 

2 

8 

10 

2 

8 




87 Fr 

*7" 

6 

10 

2 

8 


i 

6 / 

88 Ra 

2 

3 

1ft 

2 

6 


2 


89 Ac 


8 

10 

2 


1 

2 


90 Tb 

2 


10 

2 

6, 

2 

2 


91 Pa 

2 


10 

2 

-6 

1 

(2) 

(2) 

92 U 

2 

6 

10 

2 

6 

1 

2 

(3) 

93 Np 

2 

6 

10 

2 

6 

(1) 

(2) 

(4) 

94 Pu 

2 

6 

10 

2 

6 

(I) 

(2) 

(5) 

96 Am 

2 

3 

10 

2 

6 

(1) 

(2) 

(6) 

96 Cm 

2 

6 

10 

2 

8 

(1) 

(2) 

(7) 

97 Bk 

2 

8 

10 

2 


(1) 

(2) 

(3) 

98 d 

2 

6 

10 

2 

6 

{U 

(2) 

O) 

99 E 

2 

6 

10 

2 

6 

(11 

(2) 

(10) 

100 Fm 

2 

6 

10 

2 

8 

(1) 

(2) 

(11) 

101 Mv 

102 

2 

6 

10 

2 

8 

(1) 

(2) 

(12) 


Closed shells are not repeated id the succeeding parts of the Table, 
parentheses are conjectural. 


Numbers in 
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quantum electrons, of which 18 have already been placed, leaving 14 
rare earths. This result, found by means of what is in a sense numerical 
mystics, 1 has proved to be of great practical importance — it led to the 
discovery of element 72, which was named H afnium , after Copen- 
hagen, where Bohr resides. While this element was formerly looked 
for among the rare earths, this suggests that it is homologous with Zr, 
and, as a matter of fact, the zirconium minerals contain halftiium in 
considerable quantity. As in the preceding periods, the filling in of 
the 5 d group is resumed with Hf and completed at Pt. Au, like Cs, has 
again a 6s electron, and in the following elements as far as Rn (radium 
emanation) the rare gas configuration, consisting of s and p electrons, 
is again built up, so that the seven-quantum electrons begin with the 
following row. The subsequent insertion of the 6 d electrons is resumed 
with Ac. The construction of the five- and six-quantum electron shells is 
not completed, since the periodic table stops at element 92. The reason 
for this termination is undoubtedly that the heavy nuclei are unstable, 
since all elements from 84 onward are radioactive. The “transuranic” 
elements 93 to 98 have recently been produced artificially in the United 
States. As to the electron shells of these elements, the 6 f shell appears to 
begin with Ac, so that the atoms from this point onward form a new 
“ rare earth ” group that has been nan^d. the actinide series. 

The number of electrons of the saihe quantum number n contained 
in a completed shell follows from a simple but far-reaching rule dis- 
covered by Pauli.* If a certain direction is marked by means of an 
external magnetic field, the state of a spinning electron is determined 
by four quantities: major axis, eccentricity, and orientation of the 
plane of the orbit and orientation of the axis of spin. These are charac- 
terized by the quantum numbers n, l — k — 1, and the projections 
n%i and m t . Now the Pauli principle states that each combination of 
the four quantum numbers can occur but once in any atom, i.e. in 
no atom can there be two electron orbits for which all four quantities 
agree. If we remember that m t can have the values +1/2 and —1/2, 
we can dispense with the spin number by saying that in any atom there 
are at most two electrons for which n, l and m, agree. For a quantum 
number l = 1 the projections are m t — 1, 0, — 1; for l — 2, the possible 
projections are 2, 1, 0, — 1, — 2; in general, 25 + 1 different values of 
m x are possible. Hence, on account of the fact that the number of 
possibilities is doubled by including the spin, there are 2 s electrons, 6 
p electrons, 10 d electrons, &c. Since l assumes all values from 0 to 
n — * 1, the number of n-quantum electrons obtained by summation 
over all possible values of l is given by 

N n +"2 2(25 + 1) = 2a 2 , .... (76) 

* W. Pauli, Zeitschr. f. Phye., 81, p. 765 (1925). 
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i.e. exactly the relation given by what we referred to above as numerical 
mysticism! 

Despite the fact that the Pauli principle offers a simple formula 
for the number of electrons in a completed shell, the problem of the 
periodic system cannot be considered fully solved until it can be 
shown why the construction of the three-quantum orbits, for example, 
is not completed in one step but is resumed at Z — 21. Attempts to 
solve this as a problem in atomic, mechanics would lead to the hope- 
less task of computing the energy of each configuration. Fermi made 
a start toward a solution by treating the electrons in the shells as an 
electron gas. They are then subject to the Fermi-Dirac statistics 
(p. 631), and in this way it was found possible to give a very satis- 
factory, answer to the above question. 

12. Elements of the Theory of Band Spectra. 

Spectroscopists have distinguished for a long time between line 
spectra, which are due to free atoms, and band spectra, in which a mul- 
titude of lines — sometimes unresolved — present the appearance of a 
grooved or fluted surface. Even before the development of the theory 
it was surmised that band spectra are to be attributed to molecules. 
The theory of these extremely diversified spectra is exceptionally 
complicated and extensive, but offers an unexpected insight into the 
structure of molecules. However, only an outline of the theory can be 
reproduced here. 

The complex nature of band spectra is due to the fact that a mole- 
cule can contain and lose or gain energy not only in the form of elec- 
tronic energy but also as energy of rotation and internal vibration. We 
confine our attention to the theory of spectra of diatomic molecule^ 
among emission spectra these occur most frequently. Let us represent 
a diatomic molecule by a dumb-bell model. In addition to the energy 
of the electrons, the energy of this system can consist of rotation 
about an axis normal to the figwe axis and oscillation of both nuclei 
along the figure axis. 

Let us first consider pure rotation spectra, i.e. spectra resulting 
when only the state of rotation of the molecule changes. The quantum 
states of a rotator of moment of inertia I are obtained as follows. 
Since there is but one moment of inertia I normal to the figure axis, 


E = E^ = \l<f>\ (77) 

i.e. p+=l4 (78) 

If <f> is constant, the quantum condition is 

j>P4,d4 = = mh, i.e. <£ = |^. . . (79) 
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Hence the energy ia E m = (80) 


On the classical theory, the rotation of a molecule can result in radia- 
tion only if the molecule is polar, i.e. when the centres of gravity of 
the positive and negative charges do not coincide, in which case the 
molecule has a positive and a negative pole. In this case the system 
represents a combination of two Hertzian oscillators, for which the 
results of p. 341 hold. This is the case, for example, for the halogen 
hydrides, but not for the symmetric molecules N z , 0 2 , &c. According 
to the Correspondence Principle, then, pure rotation band spectra 
are possible only with polar molecules. It follows, further, from the 
uniform simple rotation (absence of overtones) that m can change only 
by one unit, so that the transition m + 1 -> m takes place in emission. 
For the emitted frequency this gives 

v “ 8^7 C(w + 1)8 ~ m2] = 4^Z (m + *>• * < 81 > 


i.e. there is a sequenoe of lines which are 
scale,* and whose separation is 


Av = 


h 

4tt 2 /‘ 


equidistant 


the frequency 


(82) 


Substituting this value in (81), we obtain a relation between the fre- 
quencies and their differences which contains only observable quan- 
tities: 

v = A v(m + l) (83) 

Remarkably enough, experience shows that instead of this relation, 
we must write 

v = A v.m (83') 

This form would be obtained by giving m half-integral values, but no 
reason for doing so is indicated. The same result is obtained by re- 
placing wt 2 in (81) by m(m + 1), as we did in the case of the g formula, 
for then we have 

" = 8 Jij t(» + D (» + 2) - »(» + 1)] = -£q (» + 1), (83") 
which means simply an increase of the current number by one, and 

* It should be noticed that these formulae, as they stand, are written in terms of 
actual frequency; it i3 easy to convort to wave numbers by dividing by e if desired. 
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thus agrees with equation (83'). If we again reduce the current number 
by one and once more call it m, we get the final form 


hm 


V ~ krfiV 


(83'") 


The newer quantum theory yields precisely this expression for the 
energy of a rotator. Thus we see once more that the theory based on 
models agrees with observation only after introducing a modification. 

We now go a step further and assume that the state of oscillation 
also experiences, a change. The resulting bands are called rotation- 
mbralion bands. While the pure rotation bands lie in the far infra-red 
— in the neighbourhood of 30 ft — the rotation-vibration bands fall in 
the near infra red, in the vicinity of 5 /*. Both types of band are 
observed in absorption, but it is more convenient to develop the theory 
for emission. The absorption lines coincide with the emission lines, 
for a large number of molecules already possess several quanta of 
rotational energy on account of the thermal motion, so that the initial 
levels of a large number of molecules already contain several rotational 
quanta, in contrast with the atoms, where the thermal energy is not 
sufficient to provide an appreciable number of excited atoms. The 
absorption lines are obtained by interchanging the initial and final 
states. 

By p. 94, the quantum states of a harmonic oscillator in which 
a mass M vibrates about a position of equilibrium are given by 

< M > 


In our model the nuclei vibrate in opposite phase, so that the centre 
of gravity remains at rest. The calculation of the characteristic vibra- 
tion of this svBtem is a problem quite similar to that on p. 651 , where 
the motion of the atomic nucleus was taken into account. Here, too, 
we find (cf. Ex. 43, p. 110) that it is only necessary to replace the mass 
of the particle by the reduced ‘mass p., where 


i_J_ , _L 
p ~ m x m; 


(85) 


Hence the energy states of a molecule which are due to oscillation and 
rotation are ' **»(« + 1) /am 

E n , = nhv 0 H I 86 ) 


As long as the binding force on the nuclei is strictly harmonic, the 
Correspondence Principle again gives A n — +1 as the selection rule 
for n. Nevertheless, on account of the fact that the restoring force is 
not perfectly harmonic, higher quantum transitions also occur; these 
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are referred to as overtone bands. The frequencies of the lines of the 
fundamental band are, according to (83'"), 


* = r 0 ± 7 


(87) 


Here the sign of the rotational part may be either positive or negative. 
Since the oscillational contribution is by far the greater (it displaces 
the system toward the near infra-red), it may happen, in emission, 
that a transition n 4- 1 -> n occurs together with a rotational change 
m — 1 m without any change in sign of the transformed energy. 
The lines thus appear on both sides of the null line v 0 , which is itself 
absent. The quantum mechanical treatment of the oscillator also leads 
to half quantum number n, but this does not alter (87). 

If, in addition to the energy transformations already considered, 
the state of the electron shells of the molecule also changes, the emitted 
radiation falls in the visible region or in the ultra-violet. This repre- 
sents the usual band spectra which have the appearance of illuminated 
columns. The essentially new feature contributed by the electron 
transition' is a change in the moment of inertia and in the interatomic 
bond by penetration of an electron into the electron shell which is 
responsible for the binding force between the atoms. We must there- 
fore distinguish between the moment of inertia in the initial state and 
that in the final state, and also between frequencies of oscillation in 
the two states. Denoting the electronic energy by E el , the energies 
of these two states are 


Ef = Er^ -j- n t hv t -J- 


Ef = ■Eg'P -fr tifhvf -f- 


hhrii (m { + 1 ) 

8 ^ 7 , * 

-(-1) 
8t t*1, ' 


(88) 


These constitute the quantum-mechanical expressions for the energy 
states, insofar as the electron shells possess no resultant spin. If the 
latter does not vanish, the total spin j of the molecule is an integral 
multiple of h/2n and, according to the kind of coupling of the indi- 
vidual spin vectors, there will be many small differences which cannot 
be entered into here. In order to keep the formulas manageable in 
such cases, we make use of the fact, mentioned above, that except for 
a small change in the constant term we get the same result by replacing 
the “ quantum-mechanical square ” x(x -}- 1) by (x -j- \) 2 . Accord- 
ingly we write A 2 ' 1 

’ * • < 88 '> 

E f = + n t hvf 4- 87 " r (jf + 1) 2 - 
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The Correspondence Principle states that transitions to j f =j originate 
in 3t =j + 1 and j t =j — 1. However, in case of a non-vanishing 
electron spin the transition j -> j is also permitted. If we include 
this case, the following formulae result for the three branches of 
a band: 

E- (positive) Branch: v — A + 2B(j -f- 1) + C(j + l) 2 , 'l 
P- (negative) Branch: v = A — 2B Ij + Cj 2 , 1 . (89) 

Q- (zero) Branch: v — A + Cj + Cj 2 , J 

where the abbreviations A, B and C are defined as follows: 



As mentioned, the Q-Branch is absent if the spin of the electron shells 
is zero. The designations R, P and Q are arbitrary and have an historic 
origin. 

The positions of individual lines are represented very vividly by 
a diagram due to Fortrat (fig. 6). In a rectangular system of co- 
ordinates take v as abscissa and m as ordinate. For the present assume 
m to be continuously variable. Then we obtain three parabolic arcs 
for the three branches. But only the points whose ordinates are whole 
numbers correspond to spectral lines. We must therefore cut these 
curves by the fines m — 1, 2, 3, . . . and project the points of inter- 
section on to the v-axis. It is then easy to see the way the head of the 
band originates; it is a point toward which the fines crowd together, 
but is not a mathematical accumulation point like a series limi t This 
representation also suggests why a band has the superficial appearance 
of being a spatial configuration. 

If we now consider the bands which belong to various values of 
n ( and n f , we must remember that the harmonic binding is merely 
a first approximation to the power series development of the actual 
law of force. As a result, the selection rule for n is not obeyed. The 
change in the energy levels caused by the higher powers in the law 
of force will not be taken into account here. The contribution of the 
change in the state of oscillation then becomes 


h ^ — % v, = (rii — n,) ^ + n,(v t — v,). . (91 ) 
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The second form yields a subdivision of the doubly infinite manifold if 
the oscillation frequency is not changed very much by the electron 
transition. In that case the second term is small compared with the 
first and we can collect all partial bands corresponding to the same 
change of » into one band group, while the entire band system is ob- 



tained by admitting all possible values of (n t — n f ). All lines belonging 
to the same band system correspond to the same electron transition. 
Thus the event which gives rise to a single line in atomic spectra is 
the cause of a band system consisting of many thousands of lines. 
However, in this connexion it must be remembered that only one 
of the many lines is emitted or absorbed as a quantum hv in each 
elementary process. If the electronic states of molecules were as 
numerous as those of atoms, we would obtain a maze of many band 
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systems. Fortunately the number of possible energy states of the 
electron shells of molecules is, evidently, considerably smaller (exoept 
for H 2 and He 2 ), so that a molecule generally possesses only a few 
band systems. The reason is that more violent disturbances of the 
electron shell which effects the bond lead to the decomposition of the 
molecule. 



CHAPTER XL 

Critical Atomic Theory: The New Quantum Mechanics 

1. The Wave and Particle Aspects of Light. Compton Effect and Raman 
Effect. 

Although the phenomena of interference appear to prove that 
light is of the nature of waves, we have always treated emission and 
absorption as though the light consisted of individual energy centres 
(quanta) of amount hv which could not be subdivided. This quantuqa- 
like energy transfer has been rigorously demonstrated by an enormous 
number of experiments. Attempts to introduce this process into the 
wave picture by assuming that the atom could store up the electro- 
magnetic energy incident on it until the quantity hv is reached, 
lead to quite impossible accumulation times for X-rays, so that 
this procedure was soon abandoned. Attempts to give the quanta 
themselves such properties that interference phenomena would result 
without waves proved equally unfruitful. There remains no other 
course but to look upon waves and quanta as two observable aspects 
of a single phenomenon whose true nature cannot be described in 
terms of any mechanical model. A crude representation of this dual 
property of light by assuming certain points on the wave front to be 
the seat of the energy also proved impossible. In no way can we 
spatially localize the quanta in a light wave. The true source of these 
difficulties will become apparent in the following section. We shall 
begin with the' discussion of a phenomenon which more than any 
other brings into prominence the corpuscular nature of light, and 
which was therefore considered for some time to be the exprn- 
mentum cruris for the quantum nature of light. This is the Compton 
effect.* 

If monochromatic X-radiation is allowed to fall on a scattering 
substance — e.g. a block of paraffin — and if the laterally scattered 
radiation is examined spectroscopically, the original line is found to 
have a companion on its long-wave side, the separation of the two 
depending upon the angle between the primary ray and the direction 
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of scattering (direction of observation). The position of the secon- 
dary line can be computed in a frankly naive way by considering the 
scattering process to be the collision of a light quantum (or photon) 
with an electron. The energy delivered to the electron according to 
the elementary laws of impact must result in a decrease of energy — 
and hence a decrease in frequency — of the primary ray. We may look 
upon the electron as being free if it is one of the outer electrons of an 
atom, for in the X-ray region the quanta have energies of several 
tens of thousands of eleotron volts, while the work of removal of an 
outer electron is of the order of but a few electron volts. This, in fact, 
nw is the reason that the pheno- 

menon is observed only in the 
X-ray region. We now write 
the equations of conservation 
of energy and conservation 
of momentum for this colli- 
sion process. The latter we 
shall decompose into its com- 
ponent equations for the 
direction of the primary ray 
and the direction normal to 
it. Denote by v' the fre- 
quency of the scattered light, by m the mass of the electron, by </> the 
angle between the primary ray and the scattered ray, and by 8 the 
angle between the primary ray and the velocity vector of the recoil 
electron (cf. fig. 1). By the principle of the inertia of energy 
(p. 258), the “ mass ” of the photon is 



(1) 


so that its momentum is 


We then have the energy equation 

h\>= hv’ -f- \mi?, 

and the two component momentum equations 

hv hv' , . 

— = — cos 6 + mv cosS 
c c 

0 = — sin tf> — mv sin 8 


(S’) 

(4) 


Since we are interested only in the frequency v of the scattered light 
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as a function of tlie angle of scattering <j>, we eliminate 8 from the 
two momentum equations and obtain 

m 2 u 2 — ^ l*' 2 + f'' 2 — 2w' coacf,). ... (5) 

For light of not too short wave-length we have v — v’ «C v (this 
approximation is not justified for y-rays), so that we may replace v 
by v in the term mv 2 / 2 of the energy equation. From (5) we then have 

( 6 ) 

whence Av — v — v = — sin 2 ^ (7) 

me 2 2 v 


This equation may be put in more elegant form by introducing the 
change in wave-length 

a ■» cAv 
AA = --^ 

in place of the frequency change. Then there results 

' 2 h , 


= 2A 0 sin 2 f. 


(7') 


The wave-length Aq introduced in this way is the wave-length of 
radiation whose photon has an inertial mass equal to that of the elec- 
tron, for then we have 


m, = 


he 

Aoc 2 


hmc 

ch 


= m. 


(8) 


The numerical value of A„ is 24 X 10“ u cm., so that this wave-length 
falls in the y-ray region. Despite the fact that we neglected the rela- 
tivistic variability of mass in deducing equation (7), it is strictly cor- 
rect, for exactly the same final result is obtained when this effect is 
taken into consideration. 

In examining the scattering of visible light by liquids or gases, 
Raman * discovered an effect which is superficially similar to the 
Compton effect, but is fundamentally different from the theoretical 
standpoint. Here again displaced lines are found near the primary 
line, but their displacement does not depend on the direction of 
scattering, as it does in the Compton effect. This phenomenon, 
too, may be interpreted, in its larger features, as a division of 
the photon into smaller quanta in the scattering process. Part of 
the energy of the primary quantum is given to the scattering mole-, 
cule as energy of vibration or of rotation, or else some of the molecular 

* C. V. Raman, Ind. Jour. Phya., 2, March (1928). See also Nature, 121, p. 601 
(1928); ibid, 122, p. 12 (1828). 
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energy is added to the primary photon. The equation of energy is 
hv — hv +nhv Q , (9) 

where v 0 must be one of the characteristic frequencies of the molecule, 
as determined from its band spectra. The details of this phenomenon 
require a more exact theory, the basic features of which will be given 
in Chap. XLI, § 3 (p. 716). 

2. The Inexactness of Atomic Observations. 

Heisenberg was the first to recognize the fact that the motions of 
electrons in the atom are, as a matter of essential principle, incapable 
of being verified experimentally. In celestial mechanics we can check 
our calculations by computing the future configuration of the heavenly 
bodies from their instantaneous positions and velocities and then 
comparing the results with. observation. For the electron system of 
an atom, however, the rigorous determinism of classical mechanics is 
lacking — we cannot ascertain the necessary initial conditions. This is 
inherently impossible, quite apart from technical difficulties, the 
reason being that any means of observation by which atomic pro- 
cesses are conveyed to our perceptions causes a disturbance of the 
electron system of the atom. ' 

If we allow our imagination free rein and disregard all technical 
difficulties, the first thing we might try is to determine the instan- 
taneous positions of the electrons with the aid of a microscope. The 
limit of resolution of this instrument is given by A/2 A, where A is the 
numerical aperture, which we take equal to unity — i.e. we take the 
angular opening of the incident pencil to be w/2. If we now demand 
that the position of the electron is to be determined to within at least 
1 per cent of the radius of the atom (10~ 8 cm.), we must employ 
light whose wave-length is of the order of 10 -u cm., i.e. 1 X.U. The 
relative change in wave-length caused by the Compton effect is ex- 
tremely large for such short waves, as one sees from the fact that for 
the characteristic wave-length 24 X.U. and a scattering angle of tt/2 
we already have AA — A. From (6) the momentum of the recoil 
electron is approximately’' 



This is the change in momentum of the electron resulting from its 
observation by means of y-rays. Since we assume a range » angle 
of from 0 to m/2, corresponding to the aperture of the microscope, the 
momentum is determined only to the quantity 
% 

Ap = mv (11) 

• Putting v' v. 
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since all the light contributing to the image must be scattered by the 
object and so must participate in the Compton effect. We may measure 
the velocity by analysing the scattered light spectrally, determining 
the velocity of the scattering particle from the change in wave-length 
caused by the Doppler effect. But we do not know how to take into 
account the velocity change due to the Compton effect. The original 
velocity might have been changed by the act of observation, in .which 
case the error of observation lies in the range A p. If we use longer 
waves in order to reduce the Compton effect we make the determination 
of position uncertain. On account of diffraction, the error A q in the 
position determination is of the order of magnitude of A, hence the 
product of the error in the position co-ordinate by that in the momen- 
tum co-ordinate is 

ApAq A . (12) 

This equation represents one of the most important principles in 
theoretical physics; it shows us the limits of applicability of mechani- 
cal causality. It has pointed out new paths in atomic physics: no 
longer is it a task of the theory to disclose subatomic processes; rather, 
the theory must — without considering phenomena whieh are inacces- 
sible to observation — create a mechanics which reproduces the 
observable quantities — the spectral lines and the energy levels — from 
a model, of which we are required to know only the component parts 
and the acting forces but not the mechanical constants of integration. 
From this point of view we can understand, for example, why attempts 
to explain chemical bonds by introducing phase relationships between 
the electron orbits of the constituents were unsuccessful. 

The indeterminacy relation, which holds for any pair of canoni- 
cally conjugate variables, becomes of especial importance when applied 
to energy and time:* 

AE.At^h (13) 

Since the quantum energy states are usually thought of as being 
absolutely sharp, this implies that we can say nothing regarding the 
time an atom remains in such a state. Actually, it is found that after 
eliminating all broadening influences like the Doppler effect, a spectral 
line still has a finite width, so that the energy levels must be considered 
to have a finite — although very small — breadth. From this we deduce 
a time interval of the order of 10 -8 sec., which is interpreted as the 
time the state persists. 

These considerations throw new light on the failure of all 
attempts to treat theoretically the process of emission itself on the 
basis of the concepts developed in the preceding chapter. Our ordinary 

* In order to see that these quantities constitute a pair of canonically conjugate 
variables, consider the second of Hamilton’s Equations on p. 122. If we put H — B — 
p it follows that dqltU — 1, i.e. 3 — f. 
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ideas would lead us to require of such a theory that it should furnish 
an equation which gives the amount of energy radiated at each instant, 
i.e. E =/(<)• However, such a relation could not be tested experi- 
mentally. If, for example, we divide the time of emission, which is 
empirically known to be of the order of 10~ 8 sec., into 10 7 intervals — 
a continuous function must permit of any arbitrarily small division 
of the variables — and if we require that it be possible to give the 
energy content of the atom for each of these intervals, we find that 
we are asking the impossible, for the indeterminacy relation gives fqr 
the error in the energy 


But the work of excitation of, say, the line A2537 is about 5 electron 
volts, or 


= 8x 10~ la erg. 


Comparison of the results shows that the error in the energy is about 
as large as the entire amount of energy available. Parenthetically, it 
might be remarked that for the radiation from a macroscopic antenna, 
on the other hand, neither the error in the time nor that in the energy 
is important, on account of the magnitude of these quantities, so that 
the description of the process of emission by means of a Hertzian 
oscillator adequately represents the facts. 

If we extend the indeterminacy relation, which was originally 
proved for electrons, to all the elementary particles — electrons, nuclei, 
photons — for which analogous hypothetical experiments can be devised, 
we see at once why it is experimentally impossible to localize the 
quantum in a monochromatic wave train of frequency v. In an 
infinitely long wave train — in this case only is the wave mono- 
chromatic — the momentum hvjc of the photons can be given exactly 
by means of spectroscopic determination of the wave-length, but this 
makes the position co-ordinates completely indeterminate. On the 
other hand, if the time at which a photon passes a given point in space 
is confined between certain limits by using a wave train of finite length, 
the light is then no longer monochromatic, for it can be represented 
by a band of infinitely long trains of Fourier waves (cf. p. 57). In 
the curve which gives the intensity as a function of the frequency, 
the distance between the two frequencies for which the intensity has 
half its maximum value is given by 


Av = 


JL 

At’ 


(14) 


If we 


use the hv law to connect the frequency with the energy 
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E of the photons we obtain the indeterminacy relation (13) from the 
equation (14) which is derived solely from considerations of wave 
kinematics. Thus the light waves accomplish for the photons exactly 
what we need to confirm the Heisenberg indeterminacy. It then 
becomes quite natural to try to find a similar correlation of waves 
with the motion of electrons. The idea of “ matter waves ” was first 
propounded by L. de Broglie. This will be considered in detail in the 
following sections. 

3. Matter Waves. 

For light, the paths of the photons — the rays — are the orthogonal 
trajectories of the wave surfaces. As long as all apertures and screens 
are large compared with the' wave-length of the light we can neglect 
diffraction phenomena and can compute the paths of the rays by 
means of Fermat’s Principle 

# • • • • <16) 

(« is the phase velocity) if we have given the index of refraction as a 
function of position. But as soon as we pass to objects whose size is 
of the order of magnitude of a light wave, simple geometrical optics is 
no longer applicable and the phenomena connected with diffraction 
can no longer be described in terms of light rays. We must in fact 
consider the wave disturbance which exists at a given surface and 
compute the subsequent spread of this disturbance (cf. p. 376). 

On the other hand, a similar extremal principle — that of Hamilton 
(p. 123) — holds for the paths of particles which obey the classical 
mechanics. The analogy with optics becomes particularly evident if 
we restrict the comparison orbits to those of the same total energy. 
This is equivalent to passing from Hamilton’s Principle to that of 
Maupertuis. By Hamilton’s Principle we have for a particle, 

0 = 8 £ {2T — E)dt = S jT* 1 2 Tdt = 8 £ 

We can now introduce the length of path as the variable of integration 
in place of the time: 

0=8 f mv^ dt — 8 f P mvds, 

Jt, at Jp, 

or since m is constant, 

S f Pl vds = 0 (16) 

J p. 

Comparing (15) and (16), we see that if we wish to associate waves 
with particles moving according to the laws of mechanics in the same 
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way as light waves are associated with light rays, we must set the phase 
velocity u proportional to 1/v in order that Fermat’s Principle shall 
be valid for the matter waves. Again, if we denote the potential energy 
by U, 

U. • • 07) 


If we speak of a wave function we must introduce a frequency. It is 
natural to dojihis by means of the old hv relation: 


0 = hv, u = 


■* /hv 


(18) 


This means that the phase velocity depends upon v; hence there 
exists dispersion of oup tentatively introduced waves. 

Now in a dispersive medium it is necessary to distinguish between 
the phase velocity u and the. group velocity u„. The latter is the 
velocity which we can measure directly by means of a recording 
mechanism, while the individual extrema in a wave train of constant 
amplitude are not distinguishable. For a material particle, on the 
other hand, we can, of course, measure the actual velocity. Hence 
we demand that the group velocity of the waves associated with a 
particle shall coincide with the velocity of the particle. Using the 
relation between group velocity and phase velocity given on p. 66, 
and making u a = t>, we have 



Employing the hv law and integrating, 



We can therefore state a dispersion law which fulfils the requirements 
1/m cc v and u g = v if we replace the constant C in equation (18) by 

*p/V(2»).* 

We then have 


hv 

V 2m (hv — U) 


(21) 


* In equation ( 18 ), C can still contain E and hence hv, sinoe all comparison paths 
are assumed to have the same energy. 
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The wave-length of the matter waves is given by 
A=“ = 


In a region free of force, 


K = h = — 
V2mE mv 


( 22 ) 


(22') 


Passing from a region where 17 — 0 to one where the potential V 
exists corresponds to entering a region whose index of refraction is 

lE ~° (23) 


-V 1 


If U is a function of position, we have a medium of variable index of 
refraction in which the orthogonal trajectories of the wave surfaces — 
i.e. the orbits — are curved. 

If, for example, we take U to be the potential of gravity and con- 
struct, according to geometrical optics, the rays of equal E passing 
through a fixed point, we obtain all parabolic trajectories having the 
same initial velocity. 

Up to this point the analogy between the mechanics of a particle 
and geometrical optics may seem to be mere idle speculation. If it 
is to be more than that, it must be possible to detect phenomena 
which justify the introduction of associated wave fields— i.e. inter- 
ference and diffraction phenomena — for particles, as was done in 
optics to prove the wave nature of light. If the “ de Broglie wave- 
length ” of an electron which has traversed a difference of potential 
V is calculated by (22'), we obtain the simple relation 



where V iB measured in volts and A is given in A.U. This shows that 
electrons whose velocity is several hundred or several thousand volts 
are associated with wave-lengths corresponding to those of X-rays. 
Hence we may expect the same diffraction and interference pheno- 
mena as for X-rays; in fact, Davisson, Kunsman and Germer had 
previously found selective maxima when a beam of electrons was 
reflected from the surface of a nickel crystal at given glancing angles 
corresponding to those of X-rays of the same wave-length.* The 
detection of the “ matter waves ” is even simpler by Hie Debye- 
Scherrer method. It is necessary only to send a narrow ray of electrons 
of uniform velocity through a thin metal foil; diffraction by the 

* See C. Davisson and L. Germer, Nature, 119, p. S68 (1927), Phy». Rev., 30, p. 70S 
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variously oriented crystals gives a ring diagram which is as perfect as 
any obtained by means of X-rays. At those places where the wave 
computation indicates destructive interference no electrons are found, 
while at maxima we observe a large number of electrons. 

Stem has even observed matter wave interference by reflecting a 
beam of helium atoms from the surface of a crystal.* According to 
equation (22'), the greater mass in this case yields a wave-length 7300 
times as short, for the same velocity. But this is compensated by the 
fact that the velocity (of thermal agitation) is much smaller in this 
instance, so that the de Broglie wave-lengths again fall in the region 
which is readily measured by means of crystals. 

If we wish to account for the more detailed phenomena of dif- 
fraction in optics, we must turn to the wave differential equation. 
Similarly, in atomic mechanics, where we also deal with quantities com- 
parable with the wave-length, we must use an analogous wave equation 


AT — 


2_a^T 

M 2 0f 2 ’ 


(25) 


where 


= a/ 2 m { kv — U) V2 m(E— U)' 


The meaning of the quantity T which satisfies the wave equation is 
still undetermined. We require of T only that its square — or, if it is 
complex, its norm TT — be a measure of the intensity of, for example, 
an electron beam — just as in any optical theory the square of the 
quantity which satisfies the wave equation is a measure of the light 
intensity, without necessarily requiring that it be a component of the 
electric vector. 

As is always done in the wave theory, we take a simple periodic 
function to represent the dependence on time: 


T = tfie Mrt — $ e 2 " iE,h - t (27) 


and obtain for the variation of tfi with position the fundamental 
Schrodinger equation f 

ty+~^(E~U)f = 0 (28) 

This equation has the same significance in atomic mechanics as has 
the Hamilton- Jacobi equation 

m 

in celestial mechanics. 


* See F. Knauer and O. Stem, Zeilsckr.f. Phys., 53, 11-12, p. 779 (1929). 
t E. Sohrddinger, Phys. Rev., 28, p. 1049 (1926); Collected Papers on Wave Mechanics, 
pp. 2, 77. 
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4. The Proper Values of the Wave Equation. 

The wave function Y governs the motion of the electrons in the 
same way as light waves determine the motion of the photons — 
wherever Y vanishes there are no electrons. If, then, we seek a solu- 
tion which is to represent the motion of an electron in an atom, Y 
must certainly vanish at infinity, for the bound electrons are all con- 
fined to the finite region. In addition, we must demand that any 
physically useful solution is to be a continuous and a single-valued 
function of position in the finite region. Simple as these conditions 
seem, they are, nevertheless, sufficient to determine the characteristic 
values which in the old quantum mechanics were singled out from 
among the many mechanically possible solutions only by imposing 
an auxiliary condition. It may be shown that the condition of single- 
valuedness is identical with the old quantum conditions for a. suffi- 
ciently large periodic orbit. To demonstrate this we shall confine our 
attention to systems with one degree of freedom. As a matter of fact, 
the extension to more degrees of freedom — and hence, in general, to 
multiply-periodic systems — does not present the slightest difficulty. 
Let us consider a closed orbit to be the orthogonal trajectory of wave 
surfaces. Then if we move along an orbit, Y must return to its former 
value when we -again reach the starting-point, in harmony with our 
requirement that Y is to be single valued. This means that the phase 
of the wave motion must have increased by an integral multiple of 2 -it, 
and this in turn implies that a whole number of wave-lengths must be 
contained in the path. In general, the wave-length may vary from 
place to place if the index of refraction is variable. We may therefore 
express the condition of single-valuedness as follows: 

<f ^ = (f i V2 m(E-V) dq=n. ... (30) 
But according to the Hamilton-Jacobi equation (p. 127), we have 

HW‘ +v=e - “• I = < 3i > 

We thus obtain the old quantum condition 

£pdq=nh . . (32) 

It might seem, at first, that this merely secures a more reasonable 
foundation for the quantum conditions without altering the result 
in the least. At the same time it must be remembered that the 
single-value condition is given a significant meaning by equation (30) 
only if the change in wave-length is so gradual that the whole orbit 
can be divided into intervals dq, within which several equal wave- 
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lengths lie. Since this condition is by no means always satisfied, it is 
quite possible that a strict calculation based upon the wave equation 
may lead to somewhat different results. 

The theory of linear differential equations shows that the con- 
dition of single-valuedness, or the boundary conditions, can be satis- 
fied only for certain values of the parameter E. These are called the 
proper values, or characteristic values , * of the differential equation 
(of. the simple cases discussed on p. 186 et seq.). Our chief task, 
therefore, is the determination of the proper values for a given 
mechanical problem; these are, by their very nature, the selected 
energy levels. The corresponding characteristic functions, or proper 
functions, on the other hand, are only of secondary importance. We 
shall illustrate the process by two examples — the hydrogen atom and 
rotational band spectra. 

5. The Hydrogen Atom. 

Since the potential energy 

U=-- r (33) 


depends only upon r, the co-ordinates appropriate to the 
problem are spatial polar co-ordinates. Denoting the electron mass 
by m 0 , the Schrodinger equation (28) (p. 696) becomes (cf. p. 268) 


3V , 

3 r 2 '*' 


? JL 

r dr ^r* lain 


s( 8ine ls) 

, 8n 2 m 0 / 

+ h? V 


1 

sin 2 <? 


9*01 

m 

& 


E+-)'P = 0. 


(34) 


We try to solve this equation by a value of ip which is the product of 
separate functions of r, 0 and <f>: 

0 = fi(r)0(0)<D(0) (361 


So far as <£ is concerned, we can have only the function 

<I> = e (36) 


or a linear function of cos mf> and sinwn^, with integral values of m on 
account of the single-value condition. Whatever function we take, 
we have always 

(37) 

* Gar. Eigenwerte, Eigenfunktionen ; Fr., constantes fondamerttales, /(motions fonda- 
mentales. 
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Eliminating <X>, we obtain the following equation which depends only 
upon r and 6: 


-_ILL L(- o d ®\ m20 ) 

“ © (sin<? d6 \ de ) sin 2 #)’ 


(38) 


which we have written in such a way that the left member contains 
only functions of r, the right member only functions of 0, so that each 
member must be a constant, say o, so that the variables are separated. 
First we examine the equation involving 6: 

s») 9 - a • <39) 

This equation is familiar as the differential equation of spherical har- 
monics. For details, see the Mathematical Addendum, § 4 (p. 819). 
We substitute . , 

cos 0 — x, (40) 

and write in place of ©(cos 6) the more common symbol y{x ) ; there results 

<‘-^S- 2 *2+(— • <«> 

As is shown in the Theory of Spherical Harmonics, this equation has 
solutions which satisfy the conditions of single-valuedness and con- 
tinuity if a is of the form 


: (Hw)(H m + 1)» 


(42) 


where k and m are whole numbers. Wo set k -f- m = Z and call the 
solutions corresponding to a — 1(1 + 1) spherical harmonics of Zth 
order. There are Z 4- 1 different functions of this kind, since m can 
assume all values from 0 to Z. For m — 0 we obtain the so-called 
surface zonal harmonics or Legendre polynomials. These are simple 
polynomials of degree Z, and are most readily obtained as the coefficients 
of the powers of r in the expansion of the expression 
F{r, x) = (1 — 2rx -f r 2 )- 1 ' 2 . 

This gives 


P 0 -l 


P^ix? - f* 
n 35 , 15 a , . 


(43) 


Pz — I — $ P & — '■& — x 3 ~f ^ x. See. 
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As one may easily verify, the polymonial of degree l is a solution of 
that differential equation for which the value of a is 1(1 + 1); for 
example, P 6 corresponds to a = 30. When the value of m is different 
from zero we again obtain polynomials as solutions. These are called 
the associated harmonics, and are related to the surface zonal har- 
monics by the equation 

fimp 

m*)=( 1 — s 2 )”*' 2 ^ (44) 


Since the 1th derivative of P, is a constant, the spherical harmonic 
belonging to m—l, which corresponds to the highest azimuthal sub- 
division, is 

P, l = K(l — x*)'l* = JT(sin0)». . . . (45) 


We have still to determine R as a function of the radius r. If we sub- 
stitute the value a — 1(1+ 1) in the separated equation (38) (p. 699), 
the differential equation for R becomes 

^ + ??+( 8 -^ + ^_‘l P>)r-0. (46) 

dr * r dr \ nr n*r r f 

We introduce the abbreviations 


A, -1(1+1) 

47r 2 m 0 e 2 _ B 


• • • (47) 


and obtain 


d*R 2 dR 
dr* dr ‘ 






(48) 


The nature of the solutions depends essentially on the sign of A, and 
hence of E, corresponding to the case of an elliptic or a hyperbolic orbit, 
as may be seen from the behaviour of the equation for large values of 
r. In this case the terms not containing 1/r dominate, so that the 
equation may be replaced approximately by 

(«) 


According to the sign of A we have either an oscillating function as 
a solution or one which tends to zero or infinity exponentially. 

Consider first the case when A, and therefore E, is negative. It 
turns out that this corresponds to the case of an elliptic orbit. Set 


A = - 


(50) 
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where A must have the dimensions of r~ 2 , in order to have AR of the 
same dimensions as dPR/dr 2 . In addition, we set the dimensionless 


L = P 

r 0 2 ' 


(51) 


In this notation, the asymptotic solution is, according to (49), 

R = Ke -o' 2 (52) 

The positive sign of the exponent is excluded if the solution is to have 
a physical meaning. We try the solution 

R^e-'IHip) (53) 

for the finite region and thus obtain the differential equation 

. + (64) 

which we again try to solve by means of a polynomial.* The theory 
of differential equations shows that the lowest power whose coefficient 
does not vanish is the 1th in our equation. If, then, we write 

v = p l w(p) = p'{a 0 +a lP +.. . a v p» + . . .), . (55) 

and use this expression and the derivatives obtained from it in equation 
(54), we obtain the recurrence formula 
[(P + 1+ 1 )(P + l) + 2(y + 1 + 1) - 1(1 + 1)]^ 

= [p + l+ 1 — rj&\a 9 . . (56) 

The series terminates with the term a p p p if 

rji^p + l+l^n, . (57) 

where n is any positive integer; i.e. by (50), if 

B 2 = —n 2 A (58) 


Hence, by (47), the possible values for the energy are 
l? — 2w * wt o e< 

‘ 


(59) 


But these are precisely the energy levels of the hydrogen atom which 
were obtained in the old quantum mechanics on the basis of the Bohr 
postulates. Corresponding to the spherical harmonic of 1th order, 
then, the lowest value of » is » = 1 + 1. This corresponds to 

* The fact that the power aeries terminates is a sufficient condition for the fulfil- 
ment of the condition of single-valnedness. That it is also a necessary condition cannot 
be demonstrated here. 
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the empirical rule of the old theory according to which it was 
necessary to count the number l from 0 to n — 1 instead of from 
1 to n. In the same way, the (21 -f 1) different values of m t which 
we interpreted as the number of integral projections of l, are here 
found to be the number of surface harmonics P{ n (x)e ±im *‘ which, by 
equation (44) (p. 700) correspond to a given numerical value of l. 
The factor 2 results from the double sign of the exponent which, 
however, gives but one value for m — 0. The proper function corre- 
sponding to the proper value E n is 

tft n = e~^phu n _ l _ l (p)P{ n {cosd)e ±im *, . . . (60) 

where the degree p of the polynomial w v is expressed in terms of n 
and l. The factor e - ^ 2 guarantees the vanishing of the ip function at 
infinity. Since we wish to compute with real <p functions, we have not 
put for A second solution, corresponding to the same proper 
value, is the same expression with sinm<£ replacing cos m<f>. 

There are » 2 different proper functions corresponding to each 
definite proper value E n , for l runs from 0 to n — I for a given value 
of n, and to every value of l there are, by the above, 2Z -j- 1 surface 
harmonics, so that the number of possible proper functions is given by 

z„ =”S 1 (2 1 + 1) = J (1 + 2n - 2 + 1) = » 2 . . (61) 

o * 

The case where several proper functions correspond to one and the 
same proper value is what we refer to as a case of degeneracy in 
celestial mechanics, since the number of independent frequencies is 
smaller than the number of degrees of freedom. For this reason we 
also refer to such cases in wave mechanics as degenerate. The degree 
of degeneracy, however, is different here — if z proper functions are 
associated with one proper value, we say the degree of degeneracy 
of the system is (z — 1). 

With regard to positive values of E, which correspond to hyper- 
bolic orbits, we may mention without proof that no discrete proper 
values exist in this case — every positive value of E is compatible 
with our requirement that the solution should be one-valued. This 
corresponds to the continuous spectra found at the series limits; 
formerly their existence was explained in an analogous way by point- 
ing out that the kinetic energy with which an electron departs from 
the atom is not subject to a quantum condition. 


6. Rotation Spectra of Diatomic Molecules. 

For simplicity we shall consider^ diatomic molecule, one of whose 
atoms is of infinite mass, the other being of mass M and at a distance 
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a from -the first. The co-ordinates appropriate to the problem are 
again spatial polar co-ordinates. We write 

a 2 M = 7, (62) 


and since there is otherwise no dependence on r and no potential energy, 
the wave equation simplifies to 


1 d 
ainffdff 


( M U) 


, i ay , 8 t me , 

+ sin 2 # d<f> 2 + h 2 * 


. (63) 


This equation holds quite generally if we take 7 to be the moment oi 
inertia about an axis through the centre of gravity normal to the 
figure axis. Comparing this equation with equation (34) (p. 698), 
we see that we again have the equation of surface harmonics, whose 
proper values are given by 


8t r 2 IE 
h 2 


= + 


_ 1(1 + l)h* 


. . (64) 


But this is precisely the form given in § 12 of the preceding chapter 
(p. 682) for the rotational energy levels of diatomic molecules; the 
old quantum theory gives this incorrectly as W/87T 2 /.* Thus the 
newer theory justifies the curious rule of the replacement of x 2 by 
x(x + 1). In general, the result is that wherever the older atom me- 
chanics yielded x 2 , the square of spin quantum numbers, these are to 
be replaced b jjc(x -j- 1), the so-call$d “ quantum-mechanical square 


7. The Physical Meaning of the Y Function. Intensity Relations for 
Spectral Lines. 

Schrodinger originally interpreted his Y function in the following 
way. The electron is not a localized charge within the atom, but 
charge and mass are “ smeared ” over a certain region. The density 
of charge is assumed to be proportional to YY, the “ norm ”, or 
square of the amplitude of the Y function. Assume first that only 
a single state of the atom, corresponding to the proper value E m , is 
developed. The corresponding Y funotion is 

Y m = tfi m e 2 ”( E ^’ h ) t (65) 

Since tf> m is real, we see that on multiplying by Y m the time factor 
disappears, so that the distribution of charge is constant in time. 
A configuration of this kind cannot radiate, in agreement with the first 
Bohr postulate. 

* Writing I instead of m. 
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For what follows it is necessary that the <p functions be ortho- 
gonal, and “ normalized to unity ”. By the latter expression we mean 
that, since there is an arbitrary multiplicative constant in the solution 
of the differential equation, we can choose it so as to make the integral 
of the square of any proper function i/r m , taken over the entire region, 
equal to unity. We thus specify that all proper functions are normalized, 

f tJdr = 1 (66) 

The orthogonal property implies that the integral over the entire 
region of the product of two proper functions belonging to different 
proper values is zero: 

J ip m tp„dr = 0 (67) 

The proof is based upon Green’s Theorem (cf. p. 270): 

J (tp m A>p„ — ip n Atp m ) dr = (j) (>p m grad ip n — tp„ grad tp m ) dS. (68) 


The proper function ip m satisfies the equation 


A<p m + ^(E m -U)ip m = 0 , . . 

. (69) 

and tp n satisfies the equation 


+ U)^ = o. . . 

. (70) 

If we multiply the first equation by i p n , the second by tp m> and subtract 
the first from the second, we obtain 

'Pm^'Pn - <P^K = ~p {E m - E n )<p n <p m . 

. (71) 


This equation is multiplied by dr and integrated over all space. The 
integral on the left is transformed into a surface integral over an 
infinitely distant boundary by means of Green’s Theorem (68). Since 
the proper functions vanish exponentially at infinity, this integral 
is zero, and the orthogonality is demonstrated. 

On account of the linearity of the wave equation any linear com- 
bination 

T ■= c 0 Y 0 + ^ + . . . c*TY - c 0 ip 0 e 2,riE ‘ lht + + . . . (72) 

is also a solution. Schrodinger interpreted this as an atomic state in 
which several natural frequencies are simultaneously developed, the 
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amplitude factors c k being a measure of the excitation. If we use 
this solution to obtain the expression 'FT* we obtain 

Y'F = c 0 Vo 2 + CiVi 2 +• • • Ck<}>h + ZcoCi'Po’Pi cos ~(E 1 —E 0 )t + ... 

+ 2c*c,^, cos ^ (E t -E k )t (73) 

Thus the density of charge is composed of a part which is constant in 
time and another part whose magnitude oscillates with the frequencies 



This variable charge, together with the nucleus, represents a variable 
dipole moment which, in accord with the laws of electrodynamics, 
emits light of frequencies v kl . When averaged in time, the contributions 
of the variable terms to the charge density vanish, and since the total 
charge must be constant, the amplitude factors must satisfy the 
auxiliary condition 

Ec* 2 = 1 (75) 

The dipole moment, from which the emission may be calculated accord- 
ing to p. 341, becomes 

j>ki — — 2 c k c t e J rifi h il>idr t .... (76) 

where r is the radius vector drawn from the nucleus to the volume 
element dr. The second Bohr postulate, by means of which, the 
quantity h was introduced a second time without justification, is 
thus fulfilled automatically, on account of (74), in the Schrodinger 
formulation. 

Although the Schrodinger interpretation must be replaced nowa- 
days by a statistical view, it gives, nevertheless, a very vivid derivation 
of the dipole components which determine the intensity of spectral 
lines, viz. 

^ = efq^ x dr (76') 

These components are identical with the “matrix elements” which 
determine the relative intensities of spectral lines in a quite different 
theory developed by Heisenberg.* In particular, the matrix theory 
gives the old selection principles correctly. 

The Schrodinger interpretation of the T function met unsur- 

* W. Heisenberg, Zeitschr. f. Phys., 33, p. 879 (1925). That the wave and matrix 
theories are mathematically equivalent was shown independently by Eckart and by 
SchrCdinger. 

24 


(B711) 
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mountable difficulties when an attempt was made to extend the reason- 
ing to several electrons. In this case ip depends upon 3 N co-ordinates, 
and the quantity <p is determined, not as a function of position in actual 
space, but in the 3iV-dimensional configuration space. In place of 
the Schrodinger interpretation we then have a statistical one; V F'F is 
a measure of the probability of finding an electron in the volume 
element dr, or — for more than one electron — the probability that the 
electrons lie in the 32V-dimensional volume element of configuration 
space. We can then average over a large number of atoms and obtain 
a mean distribution of charge which corresponds to the Schrodinger 
assumption. The chief difference between the original formulation 
and this new one due to M. Bom is, however, that, quite in keeping 
with the principles developed in § 2 (p. 690), we continue to think of 
the electrons as points and bring in the waves to take care of the inde- 
terminacy. But since observation always gives an average over many 
atoms and fairly long times, we may in many cases treat the situation 
as though we were really dealing with the instantaneous distribution 
of charge for a single atom. The representations of charge density 
given by many recent papers are to be understood in this sense. 

There is another difference in the two formulations that refers to 
the radiation process. According to (76), it is necessary that several 
proper functions be excited in order to attain a variable dipole moment. 
Translated into the language of statistics this means that in order to 
have emission, not all atoms can be in the same excited state. In 
reality the emission of radiation demands only that the excited state 
be occupied, and (76') then gives the transition probability, whioh is 
independent of the occupation of the lower energy states. 

The continuous X-ray spectrum is also explained in this way by 
transitions between the closely spaced states for which E > 0 (cf. 
p. 702). The largest jump yields a limit frequency v L = EJh, where E 0 
is the initial energy of the electron. 



CHAPTER XLI 


Atomic Problems first solved by the Wave Mechanics 

1. The Theory of Perturbations in Wave Mechanics. 

The wave mechanics not only yields the results of the Bohr theory 
— sometimes in an improved form, e.g. in the rotational term in band 
spectra — but succeeds in indicating the solutions of a number of 
important problems in face of which the Bohr theory was powerless. 
Of these we may mention the problem of several electrons (especially 
the helium problem), dispersion, non-polar chemical binding and, 
finally, the problem of nuclear disintegration (radioactivity). The 
concepts of wave mechanics throw new light on these questions. 

A rigorous solution of the differential equation of the problem of 
n bodies is no more possible in wave mechanics than in celestial 
mechanics. In the former we must also apply methods of approxi- 
mation, starting, just as in celestial mechanics, from the simple Kepler 
problem. The mathematical results needed for this perturbation theory 
will be developed in this section. 

Let us add a perturbing term to the potential energy in the wave 
equation. For an atom, for example, this would be a series in descend- 
ing powers of r, corresponding to a core potential. We therefore set 

U=U° + V’, (1) 

where U° represents the potential energy of the unperturbed system. 
If U' is small, the proper values of the perturbed system will differ 
but little from those of the normal system, and we may therefore 
take 

= E k ° + ** (2) 

In the same way, we can relate the corresponding proper function to 
that of the normal system by setting 

lA = ^*° + W (3) 

We then have for the Schrodinger equation of the perturbed system 
+ A«A*' + (E k ° +e*-I7°- U') (W + **') = 0. (4) 
Neglecting the products of perturbing terms and making use of the 
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fact that tp k ° satisfies the Schrodinger equation whose parameter is 
E k , we obtain the following equation for 0': 

w* + {Ek ° ~ V0) w = ~ nsr 5 (e * “ u ' ] +* 0 ’ (5) 

The left member of this non-homogeneous differential equation is 
identical with that of the equation for the normal system; hence for 
vanishing perturbation this leads both to the proper value AY 
and also to the proper function i/Y, as it must do. Now the theory 
of differential equations shows that a necessary condition for the 
existence of a solution of the non-homogeneous equation is the ortho- 
gonality of the perturbation function on the right to the solution of 
the homogeneous equation. This is proved by Green’s Theorem, as at 
(67) and (68) (p. 704). Hence 

/ ( e k — U') (</>*°) 2 dr — 0 (6) 

But this is an equation determining the change in the proper value 
caused by the perturbation, which is what interests us primarily. Since 
i/> t ° is normalised, this yields 

eic =fU'(W)*dr. ...... (7) 

This result corresponds to the theorem in celestial mechanics which 
states that, to a first approximation, the value of the perturbational 
energy is equal to the perturbation function averaged over the normal 
orbit. 

If the perturbations of the proper values are known, the perturbing 
terms of the proper functions also may be determined by a develop- 
ment analogous to Fourier’s series. That is, under the very general 
continuity conditions — which are always fulfilled in a physical problem 
— any function may be developed in a series of proper functions. If, 
for example, we set 

fix, y. z) = + a^i 2 + . . . «*</.* + (8) 

we obtain (cf. p. 189) the coefficients exactly as for Fourier series by 
multiplying both members by i p m and integrating over the entire 
region. Because of the orthogonality, all terms with subscript different 
from m vanish, and there remains 

J f(x. y, z)>p m (It = a m Jtpm 2 dr = a m . ... (9) 

Accordingly, we develop the right side of equation (5) according to 
proper functions 

o, . . . ( io) 
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where a m — j (O' — e k ) tp k °tji m 0 dr 4 = 0 for k 4= m, 

= 0 for k= m. 

Similarly, we suppose the function <A' to be written as a series of the 
type: 

<A'*=2 (12) 



These series are substituted in equation (5). Then, since the following 
relation holds for 


A<A m ° 


8 ■n 2 m 0 


U°ip m ° 


8^0 


comparison of the coefficients of each \fi m on both sides of the equation 
yields 


b m 


h 2 a m 
8 t -X (E k ° - E m °)‘ 


• (13) 


Thus the condition of orthogonality, (6) and (11), prevents the occur- 
rence of terms with vanishing denominator. 

A complication arises here, as in celestial mechanics, in the very 
frequently occurring case in which the unperturbed system is de- 
generate, i.e. when several proper functions correspond to one and 
the same proper value. We say that the system has a degree of 
degeneracy (a — 1) if a linearly independent proper functions corre- 
spond to E k °. Every linear combination of these proper functions also 
represents a solution of the normal system. As may be seen from the 
derivation on p. 699, the proper functions corresponding to the same 
proper value are not, in general, orthogonal. However, by means of 
a linear substitution we can always set up linear combinations of the 
original proper functions in such a way that the proper functions 
corresponding to a single proper value also have the property of 
orthogonality; in fact, we still have a number of constants whose 
values we must determine when the perturbing force is applied. 
If we put 

-AT 1 = a^-A * 1 + + a* 18 -A * 8 +••• + ] 

<A* 8 = a* 21 'A* 1 + a * 22 *A* 2 + a * 28, Afc 3 + • • • + «* 2a <A* a i, (14) 
*A** = a k al (/I * 1 -|- a k a 2 ip k 2 -(- a*" 3 <A* 3 + . . . + a k aa ifi k * J 

we have a 2 coefficients, while the number of equations to be satisfied 
is a a(a — l)/2, viz, a equations involving J (tfi^fdr and a (a — l)/2 
equations involving jtp^tfi^dr. Thus we have still a (a — l)/2 co- 
efficients free. But this is the same as the number of independent 
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coefficients of an orthogonal linear substitution, i.e. a substitution 
whose coefficients b k ih satisfy the equations 

for A *11 

-lfori-ij (15) 


This transformation applied to the co-ordinates of a point signifies 
the rotation of a set of three rectangular axes. Actually, the ortho- 
gonal property of the proper functions is not disturbed by making 
such a substitution, as may be verified at once. We make the a priori 
assumption that the a initial functions ipk satisfy the condition of 
orthogonality. If the degeneracy is removed by the perturbation, i.e. 
if the single proper value of multiplicity a is split into a different 
proper values E 0 -j- e<, it is still undetermined which of the possible 
linear combinations of proper functions yield the proper functions of 
the perturbed system. We take this into account by for ming the 
following linear combinations of the functions i/'* 1 , </>*“, which are 

already assumed to be orthogonal: 

i/,* 1 * = + VW + • • • 

= V 1 #? + VW + . . . 

t/i t °* = b k al ifrk 1 -{- 6j. a2 tftk 2 .4- - • • biP^ifi/P 



The coefficients are to satisfy the condition (15). 

According to the theory of differential equations it is necessary, 
in order that a solution exist for the perturbed system, that the right 
side of equation (5) (p. 708) be orthogonal to all of the linearly inde- 
pendent solutions of the unperturbed system. If we insert a linear 
combination ipk* in the perturbation term, we obtain from the 
equations 

./??(«*- DW*- 0 _ (1?) 

f — U')'l>k*dr = 0 j 

the a proper values E e k as well as the a initial functions (approxi- 
mations of order zero). To show this we write the abbreviation 

/ U'fi*Wdr= = e* M , .... (18) 

and substitute the values of tpf from (16) in (17). Then, since V 
is assumed known as a function of position and since the functions 
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t/i k h are assumed known, we obtain a system of equations for the 
coefficients b k ih , which we may look upon as unknowns. The system 

VW 1 - €*) + + . . . = 0 

bk 11 e k 21 -h —€»)+... 6* la e* 2a = 0 


+ . . . VfoT -«*)=0 

corresponds to Since the equations are linear and homogeneous, 

the existence of a solution differing from zero requires that the de- 
terminant of the coefficients should vanish. This yields the so-called 
“ secular equation ” 



(«***- «*) 

•• «»«“ 

= 0 . 

. (20) 


e k° 2 ... 






This equation is of degree a and so has a roots. Because of the sym- 
metry of the coefficients all roots are real. We again encounter this 
equation for each solution i/i*’ - *. Substitution of one of the roots e k T 
yields the coefficients b k n after solving the system of equations. This 
then gives the zero order approximation to the perturbed proper 
value, from which the proper function corresponding to the value 
B k ° + e k r develops when the perturbation is applied to the system. 
If the secular equation has several equal roots, this means that the 
degeneracy has not been completely removed by the perturbation. 
Then, without the addition of further perturbations, each linear com- 
bination of the proper functions associated with the multiple roots 
again represents a solution. The new proper functions thus found form 
an orthogonal set because of the symmetry of the coefficient of the 
secular equation. This represents the multidimensional extension of 
the fact that the axes of the second-degree surfaces are mutually per- 
pendicular (cf. Ex. 18, p. 37). 

The perturbation theory in wave mechanics is confined to perturb- 
ing forces which have a potential. It is necessary to extend the 
Schrodinger equation in order to be able to compute the proper values 
for magnetic perturbations, i.e. those not derivable from a potential. 
Even in this instance we are able to derive only the normal Zeeman 
effect. Only by a very complicated generalization of the wave equation 
was it found possible to obtain the additional terms corresponding to 
electron spin in the theory based on models. This solution was given 
by Dirao, but cannot be discussed in detail here, 
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2. The Chemical Bond. Formation of the Hydrogen Molecule. 

As an example of the application of quantum perturbation theory 
we deduce the solution of one of the most important problems in atomic 
physics — the nature of the chemical bond. There is little doubt re- 
garding the nature of the forces holding together polar compounds, 
which are composed of positive and negative ions, as exemplified by 
salts. The forces in this instance are those of electrostatic attraction. 
A more difficult question is that concerning the cause of the limi- 
tation of the attractive forces, so that the oppositely charged ions 
remain at a definite distance from each other — in other words, what 
is the source of the repulsive forces which appear when the ions are 
brought close together? The wave mechanics offers information on 
this point. If we imagine the positive ions replaced by protons, they 
will, of course, be attracted by the negative ions, provided that the 
distance apart is not too small. Closer approach will ultimately cause 
the proton to enter the electron “ cloud ” surrounding the negative 
ion. The ions of a simple salt possess closed electron shells of the rare 
gas type, so that the electrons are symmetrically disposed. According 
to a fundamental law of the theory of the potential, only that part 
of the electron cloud which is within the surface of the sphere upon 
which the proton lies exerts any influence on the proton. Thus the 
proton will soon come to a point where the repulsion of the nuclear 
charge preponderates. Since, by p. 702, the density of charge de- 
creases exponentially with distance from the centre, the repulsive 
forces must increase in this manner. Thus the simple law of force 
containing a single exponent is valid only at certain distances, and is 
merely a makeshift for the true law of force. This is manifested by 
the variations in the empirical value of the exponent of the repulsive 
force. 

A much more difficult question to answer is that concerning the 
nature of the forces holding two atoms of the same kind together. The 
simplest instance of such a non-polar bond is given by H 2 ; a more 
complicated instance is the bond between the carbon atoms in diamond. 
A method of explanation was first indicated by the wave mechanics 
through the introduction of exchange forces or, better, exchange energies. 
Heisenberg was led to this concept when computing the large difference 
in energy between the triplet and singlet terms of helium which — 
according to atom models — differ only in having the spins of the two 
electrons parallel or antiparallel. The small magnetic energy of two 
magnetons is not sufficient to account for the difference. If the elec- 
trons are distinguishable from each other, a given solution will always 
be accompanied by a second one belonging to the same characteristic 
value and in which the two electrons are interchanged. In this sense 
the system is degenerate and each linear combination also constitutes 
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a solution. Here again, as in § 1, we must seek those combinations 
from which the new characteristic values develop by the addition of a 
perturbation. The mutual electrostatic energy of the pair of electrons, 
which was previously neglected, constitutes the perturbation in this 
instance. Proceeding in this way, there results a large difference in the 
perturbations of the characteristic values for the two linear combina- 
tions that correspond to parallel and antiparallel spins. We can 
acquaint ourselves with the method by considering the bond between 
two hydrogen atoms: 

Consider two hydrogen nuclei (protons), a and b, at a large distance 
R apart. Let there be also two electrons. Assume the distance of elec- 
tron 1 from a to be r al — from b, r bl . Electron 2 is at a distance r o2 from 
a, r 62 from b. Call the distance apart of the two electrons r 12 . If the 
nuclei are held fixed, the Schrodinger equation for the system is 


9 V 9 a i/» 9 2 »A 

Sxj 2 dy-, 2 dzj 2 


a 2 «/> 3 2 , p 

dx 2 2 dy^ OZn 2 



f)\ 


( 21 ) 


Assuming that electron 1 belongs to nucleus a, electron 2 to nucleus b, 
the quantities within parentheses are perturbing terms. Omitting 
these terms, we can arrange the equation as follows: 

+ + + ( 22 ) 

where 'E = E 1 + E 2 . 

Each pair of terms set equal to zero represents the Schrodinger equa- 
tion of the hydrogen atom. If, according to equation (60) on p. 702, 
we therefore take the normalized solutions for the hydrogen atom in 
the ground states (w — ' 1, l = 0, m = 0) for the nucleus a as well as for 
the nucleus 6 to be 



then 0= $a(l)^*(2)> (24) 

with E x = E 2 — E 0 , i.e. E = 2 E 0 , certainly is a solution of the equation 
of the combined system from which the interactions have been 
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omitted.* This may be verified at once by substitution in equation 
(22). But we can assume with equal right that electron 1 belongs to 
nucleus b and electron 2 belongs to nucleus a. This leads to a second 
solution whioh corresponds to the same characteristic value 2 E& and 
whioh has the form 

^ = 0«$)&( 1).' (25) 

The original problem — that of two hydrogen atoms assumed to con- 
stitute a single system — is thus singly degenerate; each linear com- 
bination ip 0 ' and if) o" of the two proper functions also represents a 
solution: 


^.(1)^(2) +s* a (2)&(l) r • • * 

Four coefficients are at our disposal. Three of these are deter- 
mined by the condition of orthogonality and the normalization 
to unity: 

JtpQ^dr — J — 1 ; jijj 0 '<p 0 " dr — 0. (27) 

According to the procedure in § 1 (p. 707), the fourth constant and the 
correction term e in the energy are determined by the condition that, 
after inserting the first approximation + 4>i, the right member of 
the resulting non-homogeneous equation must be orthogonal to all 
solutions of the homogeneous equation. Since we have two solutions, 
here, the secular equation is quadratic in e, yielding two roots which 
we designate by e, and e a . The approximation of zero order corre- 
sponding to e„ from whioh the perturbed proper functions are 
developed, results after a lengthy calculation: 

V = + <A,(2)<MD}; . (28) 


♦The value of the multiplicative constant K corresponding to normalization to 
unity is not to be obtained from equation (80), It must be separately computed from 
the relation 

- K- j*” J™ z-tof.rUCldr - 1, 


front which it follows that 
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= {0«(1)^(2)-W2)^(1)}, 


where £ is an abbreviation for the volume integral 

/* 0 (l)fc(2)*.(2)*(l)<*r. 

Thus the proper function which is associated with (, is symmetric in 
both electrons, i.e. it is not altered by interchanging 1 and 2; the' 
other solution — that corre- 
i sponding to e 0 — is antisym- 

15 . I \ metric, i.e. its sign changes 

I \ if the two electrons are inter- 

| I \ changed.- The computation of 

\ t0 . | \ the roots of the secular equa- 
te 1 \ tion is somewhat troublesome, 

§ I \ so that only the results are 

^ 5 — • 1 \ . given here in the form of two 

£ \ \. curves which exhibit the value 

J \ of e as a function of the para- 
's! o — r V — j, meter R (fig. 1). The impor- 

\$h — * tant feature is that the sym- 

metric solution has an energy 
-s L minimum at a definite distance 

Fig. « from the nucleus. This solu- 

tion therefore corresponds to 
the chemical bond, while the other solution, for which the additional 
energy is always positive, corresponds to elastic rebound. 

The wave mechanics was first applied in this manner by Heitler 
and London.* It gives a surprisingly accurate value of the heat of 
reaction in this case, considering that only the zero order approxi- 
mation was computed. But in addition, this method is capable of yield- 
ing information on the possibility of forming certain compounds if 
we translate the Pauli Principle into the language of wave mechanics. 
According to this law, no two electrons in an atom can agree in all 
four quantum numbers. In wave mechanics, this means that the </i 
function cannot be symmetric in the co-ordinates of two electrons. 
This might cause one to believe that the symmetric solution for the 
hydrogen molecule is excluded. It must be remembered, however, 
that file Pauli Principle includes the spin orientation (parallel or 
antiparallel to a given direction) as a fourth co-ordinate, while up to 
this point the electron spin has not been included in the wave me- 


Q. Heitler and F. London, Zeitachr. /. Phya., 44, p. 466 
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chanics. According to Pauli’s rule, two electrons having opposite 
spins may agree in the remaining quantum numbers. Thus, taking 
into account the doubling of the possibilities resulting from the spin, 
we can adopt Pauli’s Principle by saying that the ift function of an 
atom or molecule can at most be symmetric in certain pairs of electrons, 
i.e. it may be such that when electron x and electron y are interchanged 
the function remains unaltered, but in that case not when x and electron 
z are interchanged. The electrons in which the ift function is symmetric 
have opposite directions of spin. For the helium atom, for example, 
the proper function for the ground state is symmetric in both 
electrons, corresponding to the equal quantum numbers n, l and tn t . 
The symmetric function for two atoms would then be symmetric in 
four electrons, which is impossible according to Pauli’s Principle. 
But the solution corresponds to that of the chemical bond even in 
this case. The symmetric proper function resulting from the com- 
bination of two electrons which, in the individual atoms, have no partners 
whose three quantum numbers are the same, but which have oppositely 
oriented spins, represents the interpretation of chemical valence on the 
wave mechanics. 

3. The Theory of Dispersion and of the Raman Effect in Wave 
Mechanic!. 

We now consider the behaviour of an electron in the rapidly alter- 
nating electric field of a light wave. Here, as in the classical dispersion 
theory, we neglect the magnetic forces. The perturbing potential con- 
tains the time explicitly. The previous form of the wave equation is 
not suitable for treating perturbation problems of this kind, for it 
already contains the characteristic parameter, and hence there is a 
separate differential equation for each state. It is not difficult, how- 
ever, to eliminate the parameter E. It follows from our solution 



T = ./re*"*'* ' 

. . . (30) 

that 

9T 2t ritf™ 

9 1 h ’ ‘ * * 

. . . (31) 

hence 

n A aV 1 

an f> 

. . . (32) 

Substituting this 

in the actual wave equation and remembering that 


9*Y 

A* 

. . . (33) 

we obtain from 

A „, Zm(E-V)dW 

AT w~ w 
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tiie form 


AT — 


8ir*m rm , Arrim 3T A 

__ 0 , 


(34) 


which no longer contains a characteristic parameter. 

Let the electric field of the incident light wave be along the z-axis 
and of amplitude A. The perturbing potential is then 

V = —eAz cos27 rvt (35) 


and the new form of the wave equation is 

Sn^wt . rrn . _ Anim 3 T 

AT p- ( U° — eAz cos2tti^)T p- — = 0, 


* «/• 8/i 9 m Tjnxif imm 3T 
AT — W UY ~~h di = 


i-nhn 


eAz^-t+e-***) T. (36) 


We may substitute the unperturbed Ath proper function T* 0 

for T in the perturbing term on the right.* We then have 


AT_8Jn^°T_ 


47rim 3T 
— Tt 


- ~ 1 eAzie*'*'*** + . (37) 


It is natural to take as a solution the sum of the normal solution and 
two other terms of frequency v k 4- v and v t — v respectively, i.e. 

T = T*° + ^ + e*«**+* + . . (38) 

Substituting this in equation (37) and noticing that T*° causes the 
left member to vanish, we obtain two differential equations — one for 
<fi + , one for </»_ — which we may write as the single equation 

A<£ ± + W ± = - ^ eAz+f. (39) 

This equation is again of the form of the perturbed Schrodinger 
equation which was treated in § 1 (p. 707). It differs from the former 
equation, however, by the presence of the term +Av. This has an 
important consequence: in the case at hand, the homogeneous equation 
has no solution whatsoever. The parameter is E k + hv, and a solution 
exists only for the value E k . The theory of differential equations shows 
that the rum-homogeneous equation always has a solution in this case. 
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The condition of orthogonality which must be satisfied when the 
homogeneous equation has a solution, in order to prevent the ap- 
pearance of the term with vanishing denominator E k — E k , is absent 
here. As for the rest, we can then proceed according to the method 
developed in § 1: we expand the function zip k ° of the right member 
in a series involving the proper functions 

#*° = 2a n * </,„<>, (40) 

a nk =+f^k°^ n °dr (41) 

and write i p + and «/>_ also as series of the same type: 

— 2 6 ±t ,</r n ° (42) 

This is substituted in (39). Remembering that satisfies the equation 
C®n — V°) t/j„° — 0, ... (43) 

comparison of coefficients yields 

; • • • (44 » 


Thus we can write as the complete solution 


I e2«(»J k+v)i p z<,i(v k -v)t s 

>r,_ + + '«) 


By using our working hypothesis that e x F v F gives the density of charge, 
we can obtain the electric moment of the atom from this equation. 
The polarization is concerned with the z-component: 

P t = ej z TTdr. ...... (48) 

Substituting the value resulting from equation (45), and using the 
relation 

f Z'frn'I'kd’T — a nk , 

we obtain the following expression for the temporally variable part 
of the moment (the constant part is of no further interest): 

P,{t) = 2e*A cos . (47) 

Thus the temporally variable part has the same frequency as the 
incident wave. Since the ratio of the amplitude of the electric moment 
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to that of the electric field strength is the polarizability a, this quantity 
is thus determined. 

If this value is substituted in equation (3') (p. 452), the value 
obtained for the index of refraction in the state k is 


n — 1 = 4tirNe 2 Ha ni 

— '%nN y So, 
A 


E n - E k 

(E n ~ E h f - AM 

S Vnk 

* V„* 2 - • • 


(48) 


If we notice, in addition, that the spectral lines of the atom are given 
by the frequencies v kn , we see that we have obtained the characteristic 
denominator of the classical theory of dispersion by means 

of quantum mechanics. Thus, apart from the fact that the constants 
in the numerator have a different meaning, we have here the con- 
necting link with the old dispersion formula of p. 452. 

In the language of the Bohr theory, the quantity a nk is a measure 
of the probability of a transition connected with radiation of fre- 
quency v nk for a spatial distribution of radiant energy as given by an 
oscillator vibrating parallel to the z-axis. Thus a„ k is a measure of 
the strength of the classical equivalent oscillator. Comparing the 
constants of the classical and wave mechanics dispersion formulae we 
obtain 


/nfc = 


877 hna nk 2 v„ k 

h - 


(48') 


The terms having the negative sign were not included in the classical 
formula. These appear if E n < E k . For this it is necessary for the sub- 
stance examined to be in the excited state, while dispersion measure- 
ments are usually made on substances in the ground state. Using 
excited hydrogen atoms, Ladenburg succeeded in establishing the 
existence of the negative dispersion terms. 

The remaining components of the induced electric moment are of 
no consequence for the dispersion, but determine — together with the 
z-component — the scattering power of the atom with respect to the 
incident radiation. We shall not consider the general theory of the 
scattering of light by individual atdlns, but shall treat the particularly 
important special case of the Raman effect. 

If several proper functions of a system are active, i.e. if the 
system is in a state of transition, combination frequencies will be 
observed in the scattered radiation. This effect was predicted theoreti- 
cally by Smekal, and found independently by experiment by Raman 
and by Landsberg and Mandelstam at about the same time. Let the 
normal system contain the frequencies v k and v,. Then, by means 
of the same perturbation theory as above, the solution for the 
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[Chap. 
is incident on the 


Y = We*™# + tfife 2 ™? 
- W'VAa* [=■ 


E n -f- hv Ejg — E n — hv j 




f e 2W (-i+-)» 
\Ei — E n + hv 


+ 


e 2 « <'!—>» ) 

Et-E.-hv j 


• m 


The conjugate solution is 

Y = «-*«►*« ^,o e - Mr t t 


B* — 2?„ + A* 

e-2^,+0* e-2'*-/— 0« 


+ T kv + Mt ^ En _ *-;} 


c g—2w«(pi-f r)l toO’j— r)t A 

+ +b + <49-) 

The scattering is determined by the moments 

M m =f x^Vfdr, M v = f y'V'Vdr, M,=* ft zYYdr. (60) 


It is immediately evident that, in forming these expressions, terms of 
frequency v — v u and v -+- are obtained, i.e. the effect of the 
scattering is the same as if a quantum of the characteristic frequency 
v u were added to or subtracted from the quantum hv of the incident 
light. The constants appearing when the moment is determined are 
of great importance. If, for example, we compute M v , and use, in 
addition to 

a»*=/s«An<M n (51) 

the abbreviation 

Kk=fy^n<Pkdr, (62) 

then the term containing v + v u has coefficients which contain the 
products b nk a nl . This implies that the intensity of a Raman scattered 
ray is not determined by the intensity of the light of frequency v lk 
but by the intensities of lines resulting from transitions from higher 
levels n to the levels k and l. Thus is explained the fact that fines 
which are absent from the infra-red spectrum of a molecule on account 
of the vanishing of a, k can nevertheless give rise to intensive Raman 
lines. 


* On aooonnt of the nnellnees of the perturbation we oan neglect all products of 
perturbing terms. 
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The theory developed here is still incomplete to the extent that it 
predicts equal intensity for the two lines v + v ik and v — v lk , and 
connects the appearance of Raman lines with excitation of the mole- 
cule. Actually, the “ Stokes ” line v — v Xk can appear even at absolute 
zero, while the “anti-Stokes” line v + v u only sets in upon excitation, 
i.e. at higher temperatures.* This is readily understood from the stand- 
point of quanta, for at absolute zero a quantum of radiation can transmit 
energy to the scattering molecule but can obtain no energy from it. 

The questions of intensity and of polarization have been solved by 
a more refined theory of the effect. It is possible to gain an insight into 
the nature of what is going on by means of the following considerations 
which are in the spirit of the actual calculation: In questions dealing 
with dispersion, the classical concepts generally prove to be far-reaoh- 
ing in their validity. Classically, the scattering of light arises from the 
forced vibration of the electron shells, with consequent radiation. If 
the polarizability of, say, a diatomic molecule changes with the nuclear 
separation, then the amplitude of the radiation will vary with the same 
frequency as the nuclear vibration; but a spectroscope analyses this 
modulated wave into its component frequencies, as detailed on p. 58. 

L The Surmounting of Potential Barriers in the Wave Mechanics. 

The phenomenon of radioactive disintegration, which will be con- 
sidered in Chapter XLII (p. 739), presents us with the following problem. 

By experiments on a-particle scattering it has been shown that 
for Uranium 1 the Coulomb field holds down to a distance of 3 X 10~ 12 
cm. Apart from a nuclear charge which is two units less, the same 
potential curve is valid also for the element UX 1 which results by the 
ejection of an a-particle. For a closer approach, the potential curve 
must finally bend downward, for in the U 1 atom the a-particle is 
generally bound, and hence must be at a minimum of the potential 
curve (fig. 2). Again, an a-particle ejected by U1 has an amount of 
energy which corresponds to its being released from rest at a distance 
jq == 6 10 -ia cm. and being allowed to accelerate in the field of the 

nucleus. The question arises: How does the a-particle cross the 
potential barrier between its normal position and the point q ? There 
is no possibility of explaining this fact on the basis of classical me- 
chanics. Gamow and, independently, Condon first recognized that in 
wave mechanics it is possible to encounter a particle where it does 
not belong on. the basis of its classical energy — i.e. every potential 
barrier is surmountable. Indeed, according to the statistical inter- 

* This terminology is based on Stokes’ law of fluorescence, according to which 
fluorescent radiation is always of longer wave-length than the exciting light. The 
law follows at once from a consideration of absorption and emission in terms of an 
energy -level diagram. If the atom or molecule is not in an excited state to begin with, 
the energy of the re-radiated photon can never be greater than that of the one that was 
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pretation of the Schrodinger <p function there is a definite probability 
that the electron of the hydrogen atom will be found far from the 
nucleus at a place where it could not be expected at all on the basis 
of its energy (proper value) parameter. 



We shall consider the surmounting of the potential barrier for the 
simple model investigated by Gamow, since we are interested only in 
the essential features here. Assume the potential energy of a particle 
to have a constant value tJ from x — 0 to * =- 1 and to vanish in the 
remainder of the region. Now allow a pencil of a-particles (or, equally 
well, of electrons) to pass from left to right. According to classical 
mechanics, all particles whose kinetic energy is smaller than U will 
be thrown back, while all particle# possessing greater energy will pass 
the barrier. According to the quantum mechanics; neither of these 
statements is exactly true. We limit our present considerations to 
the determination of the number of particles with smaller energy which 
succeed in crossing the ob- 
stacle. To this end we must 
divide the space into three 
regions x < 0, 0 < x < l, 

x > l, and assume a Schro- 
dinger wave for each. The 
boundary conditions must Fig. j 

also be taken into account. 

The situation is simplest in the third region; there we certainly 
have but one wave travelling to the right, while partial reflec- 
tions are to be expected at the boundaries, so that we must have 
waves moving in both directions in the other two regions (cf. fig. 3). 
The Schrodinger equation in Regions I and III is 



d 2 <p 8n 2 M 
dx* + h a 


= 0 , 


(53) 
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-S+Tr ( *- |W " 0 (63 '> 

Since we are concerned only with the ratios of the amplitudes, we set 
the amplitude in Region III equal to unity, and thus obtain the three 
solutions for the three regions: 

fa a x e + b x e- ik ' )> 1 

iP t = a#** + b#-** , .... ( 64 ) 

Pi — e - **** J 

where 

Jc q = ^V2ME, Jc 2 = ~V2M(E-U). 

The solutions must merge continuously at the boundaries in such 
way that at each boundary the number of arriving particles must, on 
the average, be equal to the sum of the number reflected and the 
number transmitted. This signifies the continuity of the ip function. 
On account of the existence of the Schrodinger equation the first 
derivative must be continuous also, otherwise Kip and hence ip would 
be infinite at the boundaries. These conditions yield the following 
equations at x — l : . 

| = -V*.' + b (55) 


Solved for c a and b 2 these become 


a 2 


b , 


*0 - 


Similarly, at x — 0 we obtain 


( 56 ) 
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We are primarily interested in b lt i.e. the amplitude of the incident 
wave relative to that of the wave traversing Region III. Substituting 
the values of o a and b 2 from (55) in (56), there results 

• • (57) 

The intensity of the wave is obtained by multiplying the amplitude by 
its complex conjugate value. For E > U this yields 

+ *[ , -(D1[ 1 -(l)]“ s2y <58) 

According to classical mechanics, all particles would reach Region III 
when E > 1 7; the above formula gives the ratio of the number of 
incident particles to the number of particles passing through as given 
by wave mechanics. The case E <U is, however, of greater impor- 
tance. In this case we take 

k t =ik=~V2M(U~^E) . . . (59) 

and obtain 

»A-J-i(|-0‘ + *(|+iy»,hai. (60, 

Since cosh x behaves like e x when x is large, we arrive at the essential 
feature of the connexion between energy and radioactive constant 
in spite of the simplicity of the model used. The intensity of the inci- 
.dent wave, referred to that of the wave in Region III, increases ex- 
ponentially with the difference V — E, or — reversing the statement — 
the ratio of the number of particles which succeed in crossing the 
barrier to the number arriving there decreases exponentially with 
increasing U — E. 

The result has an importance reaching far beyond the special 
problem from which it has been derived here. The overcoming of 
obstacles in quantum mechanics which appear insurmountable in 
ordinary mechanics is important in considering such problems as the 
escape of electrons from metallic surfaces and the kinetics of chemical 
reactions, but these questions cannot be entered into in this work. 
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5. Energy Bands of Electrons in Metals. 

(a) General Survey of the Problem 

The concept of an electron gas moving freely among the ions of the 
lattice, when used in conjunction with the Fermi statistics, is able to 
account correctly for many features of the properties of metals (see 
p. 602). As mentioned there, however, further development of the 
theory must take account of the interaction of the electrons with the 
lattice ; we can no more neglect the large electrostatic forces here than 
we could in dealing with electrolytes. In this way only we arrive at an 
understanding of phenomena which were formerly not even quali- 
tatively explained, e.g. the reversed sign of the Hall effect occurring 
for some metals (cf. p. 733). Further, such a theory would be expected 
to furnish a criterion as to whether or not a given lattice made up of 
atoms of a specified kind will possess metallic properties. These ques- 



tions are .actually answered by incorporating the Fermi statistics into 
a wave-mechanical treatment of the motion of electrons in the periodic 
potential field of the lattice. Because of the singular behaviour of 
particles with respect to potential barriers (cf. § 4), this motion differs 
from that to be expected on the basis of particle mechanics. This is 
most easily seen in the one-dimensional case, to which we restrict our 
treatment. This calculation involves less work than the one in three 
dimensions, but qualitatively — and that is all that concerns us for the 
present — it yields the same result. 

Suppose an electron moving in the direction of the positive ai-axis 
(fig. 4) to be momentarily in the potential trough at P. According to 
quantum mechanics it can surmount the next crest even if its kinetic 
energy is less than the amount corresponding to the crest. On the 
other hand, even in instances where it has sufficient energy, there 
exists a given — although slight — possibility of reflection, as mentioned 
above. If the de Broglie waves reflected from the several crests agree 
in phase, the result may be a reflection of such magnitude that the 
motion to the right becomes impossible. This is nothing more nor less 
than the reflection in depth for X-rays, which was treated in detail on 
p. 388. Precisely this circumstance is the key to an understanding of 
the original questions. 
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(6) Wave- Mechanical Expression for Current Density 
On p. 703 the expression YY was shown to be a measure of the 
density of charge. Now an analogous expression for the current density 
will be deduced. Since a current is a process occurring in time, we 
must start from the Schrodinger equation (p. 717), which depends on 
the time: 


AY- 


8 rt 2 m 

~W 


77Y — 


ivim 3Y A 

37 = °‘ 


. (63) 


According to a theorem ip the theory of functions of a complex 
variable, the conjugate function Y satisfies an equation in which all 
coefficients are replaced by their conjugates, since the real and the 
imaginary parts must separately satisfy the same equation. Hence Y 
satisfies the equation 


AY- 


8 rchn ™ irrim 3 Y _ 
~¥~ UHr + ~T'dt~ 0 ‘ 


. (64) 


Multiplying (63) by Y and (64) by Y and subtracting, we have 

l YY = YAY - YAY = div (Y grad Y - Y grad Y). (65) 

h ot 

The appearance of the gradient of a complex number may seem strange; 
but one may compute exactly as with the gradient of a real quantity, 
since it represents a combination of the gradient of the real part and 
that of i times the imaginary part. 

Since Y has been normalized in such way as to make J YY dr — 1, 
the time rate of change of the charge density p is given by 3(eYY)/3i. 
Also, as for any current, the equation of continuity of the electrical 
current is (of. p. 196) . 

= div i, (66) 

where * is the current density vector. Thus the current density is 
represented in wave-mechanical form as 

i — ( Y grad Y — Y grad Y). . . . (67) 

The total current carried is given by 

1 = ^/(Y grad Y - Y grad Y) dr. . . (68) 
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While this cannot be done for (67), it is possible to put (68) into a 
simpler form. Inasmuch as Y vanishes exponentially at infinity, the 
integral YY dr over an infinite sphere is zero. Hence by equation 
(59), p. 81, 

0 = <^> Y Y dS grad YY dr = -f(¥ grad Y + Y grad Y) dr, 

and so the total current becomes 

HLf 9 ‘» a ' riT - w 

Since we have Y = and Y = ijjer Uat , it is also possible to write 
1 = • • • * (69<) 

If V P = tyefut and if ^ is real, the expression for i vanishes. This may at first 
sight seem surprising, for on p. 698 we used real <{< functions in treating the hydro- 
gen atom. However, we know from its magnetic behaviour that a rotational 
current is present in this case, as required even by the revolving electron of the 
simple atom model. In reality, the existence of a magnetic field compels us to 
carry out the hydrogen calculation using complex 4> functions, which yield a 
current. According to wave mechanics there oan be no current without a mag- 
netic field, since the functions give the mean values, and in the absence of a 
magnetic field all orientations of the orbits occur with equal frequency and so 
cancel on the average. 


(c) Electron in a Periodic Potential Field 

Let us again restrict our consideration to the one-dimensional case, 
which depicts all the essential facts concerning the behaviour of elec- 
trons in a space lattice. The form of the Schrodinger equation that 
does not involve the time is 


d 2 <ft 

dir* 


+ - (E ~ V) tjt — 0, 


(70) 


where V is assumed only to vary periodically in space with an interval 
equal to the lattice constant a, i.e. 


V(x + a) = V(x). 


(71) 


If *fj m (x) is a solution corresponding to the proper value E m> then 
'hni x -[- a) is another solution corresponding to the same characteristic 
value; because if we set x + a = x’ in (70) we get 


8irhn 
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On account of the periodicity of F, this is again identical with (70). 
Since \£e rule out the degenerate case, this is possible only if both 
solutions differ by a constant factor: 

4(x + a)=pf(x). (73) 

In a crystal of some extent, the condition ip must recur after a 
certain distance. Even if we take tie whole crystal to be a unit for 
purposes of calculation, we may imagine it to be succeeded by an 
identical region. In the one-dimensional case, assume this fundamental 
unit region to be 

£ = (?«, (74) 



where G is a very large number. Since the potential is due to the ions 
of the lattice, G also is the number of ions in the unit region. By re- 
peated application of (73), 

0(® + G°) = <A(s) = V e 'P( x )> 
from which it follows that 

p = e 2j,im,G and ip(x + a) = e* nimlGo >P(x), m --- 1,2 G. (75) 

This property is possessed by the function 

tP(x) = e 2mmxlGa u(x), . . . , . (76) 

if u is a periodic function whose period is a. If V is zero, the solution 
is simply 

ifi( x) = e 2nimx,Ga x const (77) 

This represents a plane wave which moves from left to right or from 
right to left, depending on the algebraic sign. The periodic potential . 
field, in conjunction with u(x), results in a spatial modulation of the 
wave (cf. p. 57 and fig. 5 here). 

Introduce k m as an abbreviation for 2w/A. Then 
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= m = 1,2,... . (79) 


According to p. 695, the relation between the wave-length and 
speed of a free electron is 


2 irm e { 


Consequently the energy of the free electron is 

FF F 


E = \m e \i? — 


8 irhn t \ 


(80) 


(81). 


(82) 


Since a free electron may have any speed, E and k are related para- 
bolically. 

As long as the potential is small compared with the electron energy, 
we can start from the solution for free electrons and proceed by the 
established methods of quantum-mechanical perturbation theory, 
considering the periodic potential as a small perturbation. For this 
purpose we develop V in a Fourier series: 

V = '£’°v n e 2!li ” x,a , (83) 


where, according to p. 55, = v n . Further, let V be normalized to 

make the mean value w 0 equal zero. 

The coefficients v„ are to be considered given, and we seek the 


Fourier coefficients, which replace u m in (76): 

= e^u m (x) (84) 

For simplicity, we omit the index m and write 

u =" s"c B e 2n< "* /0 (85) 


When these Fourier series are substituted in (70), the product of 
two series appears in the term Vtfi. If the terms having the same ex- 
ponent are made to vanish, equations determining the c„ are obtained. 
For instance, collecting the terms involving e 2n ' - 4x;a , the linear parts 
of (70) yield a term containing c 4 , but the product Vtft yields a sum of 
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terms whose index sum is 4, e.g. qq, qq, qq. The equations, solved 
for c m are 

C ” ~ £ (& + 2 im/a)* ^ 

P 

As long as the denominator does not vanish, all of the c n , with the 
exception of e 0 , may be allowed to approach zero as v n does. This 
means that the proper functions and proper values of the electrons in 
the weak, periodio field difier but slightly from those in the absence of 
forces. However, it must be noted that the k m now form a discrete, 
yet very dense, set. The situation is different, however, if the de- 
nominator vanishes; then the c„ can no longer be small. Thus for 
k = —vja we have 



where, according to (81), E = n 

The same denominator is obtained for c 0 . In order to investigate 
the character of the result at these places we approach —ir/a from 
neighbouring points by trying to determine Cq and q for k — (—■ nja) + 8, 
the corresponding proper value being E — (iPjcPp) -f e. 

Neglecting all coefficients shown above to be small, we get the 
following equations for c 0 and q: 



These linear, homogeneous equations can be satisfied only by the 
v anishing of the determinant. This leads to an equation for e; v 


€ = ± v ~ lVl + 


'=±J^r+ i 0 <»> 


In the limit, with 8 = 0. 


« = ± I h I- 


• • (90) 
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Thus for k = —it jo, we get two values of E t , of which the smaller is a 
continuation of the E values that correspond to smaller values of k. 
But this means that tb.e parabola mentioned on p. 729 has a break at 
k = —it /a, and the same thing occurs at k = ±2ir/a, Sir /a, &o. Thus 
there are breaks in the spectrum of proper values. These places corre- 
spond exactly to the reflection in depth of the de Broglie waves from 
the row of points, similar to that of X-rays considered on p. 388. For 
k — +mr/a the waves reflected from the several points are in phase, 
so that no wave moving from left to right can arise. In the three- 
dimensional case these breaks occur for those directions of propagation 



and those wave-lengths for which, according to p. 388, reflection takes 
place. The trend of the proper values-is as shown in fig. 6. 

It is important, for what follows, to notice that the curve is hori- 
zontal at the points of discontinuity, as computation shows. 

With increasing perturbation the breaks in the energy spectrum 
become broader, since the jump in E at k — n-nja is conditioned by 
v„. However, for large perturbation, the present point of departure of 
the calculation is no longer the correct one. In that case, one starts 
with the proper functions of the electrons bound to an ion of the lattice. 
The result is that for very strongly bound electrons (K shell) the in- 
fluence of the neighbours ceases to be important. For the outer elec- 
trons the sharp term values of the free atom change to broad bands 
that correspond to the above, except that the breaks no longer occur 
exactly at nirja. These bands are designated according to the terms 
of the free atoms from which they developed. Thus one speaks of the 
4s or of the 3d band of potassium. 
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(d) Relation between Current and Energy Values 
In order to connect the behaviour of electrons in a periodic poten- 
tial field with the conduction properties of matter we shall deduce an 
important relation between the current and the function E(Jc). Despite 
the fact that the proper values constitute a discrete set, they he so 
close together that they may be considered continuous inside a band 
E. If k is assumed to increase by a small amount a, the E k will change 
by an amount /S and the function u k (which has a period a) will change 
by a function v. Thus to k + a there corresponds the proper value 
E k + 1 3 and the proper function e' ( * + *>*(«*. + v), or, putting 

e ih v= X , ( 91 ) 

we have = (tfi k + ■ . (92) 

To determine j8 and x we substitute into the Schrodinger equation. 
Since ifi k satisfies equation (70), we get the following equation for x 
by discarding the products x a an d wjS : 

, ^X + tl (E k -V)x==-fiy4*-2ia d ^. . . (93) 

According to p. 708 this inhomogeneous equation will have a solution 
only if the solution x — '/’k of the homogeneous equation is orthogonal 
to the right side. The situation differs from that on p. 708, however, 
in that >jj k is complex. In this case the condition for orthogonality 
is similar to that on p. 708 except that in place of ifj k we have the 
conjugate value ift k . Then, since j'tp k >f) k dT = 1, 

' — */*£*■ (94) 

By equating (69), p. 727, the current is given by 

■ = e- 4 ***- < 95 > 

Also, in the one-dimensional case, 

grad ^ = 


Then, on account of (82), p. 729, we have the important relation 


p dE k h j 

a dk 2 ne ’ 


(96) 


Ot 


/ 


2 w edE 
Ji dk' 


(97) 
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The quantity dE/dk vanishes at the edges of the bands, as men- 
tioned above. Hence the current, which is carried by electrons having 
the limiting energy, is zero. Physically, this remarkable result means 
that because of the reflection in depth of the de Broglie waves there 
is now a standing wave rather than a progressive one. 

(e) Conduction Properties of Matter; Algebraic Sign 
of the Hall Effect 

We are now in possession of all the essentials for building the wave- 
jmechanical theory of metals. To be added, as in the case of an electron 
gas, is Pauli’s principle, i.e the Fermi Statistics. We are then in a 
position to give qualitative answers to the more important questions. 
Space does not allow a description of the quantitative three-dimensional 
theory, for which the special literature should be consulted.* 

A band contains G closely spaced energy levels. The Pauli principle 
requires that every quantum state bfi only singly occupied, and on 
account of the two possible spin orientations we can .thus accommo- 
date 2 G electrons in each band. There are G atoms in the unit region. 
Without regard to spin, each electron in an atomic configuration is 
characterized by the quantum numbers n, l, m x (cf. p. 679). For each 
trio of numbers, designating a single band, there are two possible 
electrons per atom. This is precisely the case for closed shells. For 
example, this is true for the inner as well as for the outermost shells 
of the alkali halide ions, which are like the rare gases in this respect. 
The alkalis, on the other hand, have a single electron outside the rare- 
gas configuration. But, because of the spin, the Pauli principle allows 
for the presence of two such electrons, so that the band corresponding 
to the valence electron is only half-occupied. 

What influence on conductivity does the extent of occupation of the 
bands have? In an electric field the electron obtains energy and so 
moves up to the higher levels .within a band. In general, however, the 
energy that can be obtained from the field is not enough to force a 
transition to the next band. If the upper levels are already occupied 
(rare gases) a displacement of this kind is not possible, and the material 
is an insulator. The case is otherwise for the alkali atoms, where the 
half-filled state of the bands provides enough room for such displace- 
ments, and so the alkalis in the solid state are typical examples of 
metallic conductors. 

The case of the alkaline earths is more complicated. The ns shell . 
is fully occupied by two electrons. That these elements are neverthe- 
less conductors is explained because the adjoining band lies so close 
to the ns band that the two overlap. The effect of the field is a double 

* See, for example, F. Seitz, The Modern Theory of Solids, McGraw-Hill Book Co., 
1940. 
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one: the number of electrons in the next higher band is increased, 
and vacancies oocur in the ns band. The effect of these vacancies will 
be explained below. Another situation that may arise is that two 
bands are separated, but that the distance between them is so small 
that thermal motion suffices to elevate individual electrons to the 
upper band. Substances where this occurs are the simplest electronic 
semi-conductors. For such materials the conductivity does not de- 
crease with increasing temperatures, as it does for metals; rather, 
corresponding to the increasing number of electrons reaching the next 
band, the conductivity increases exponentially. 



The Hall effect, mentioned above on several occasions, is the fol- 
lowing phenomenon: Let a current flow through a rectangular plate 
in the direction of its longer side (fig. 7). The equipotential lines will 
be perpendicular to the lines of flow. Between two points P x and P 2 
lying on an equipotential there will be no potential difference, and a 
voltmeter joining them will read zero. If we now apply a magnetic 
field directed outward and perpendicular to the page, there will be a 
force — e[vH] on each moving electron and it will be deviated to the 
right.* The accumulation of electrons on the right side gives rise to 
a transverse electric field held in equilibrium by the magnetic force. 
The next effect is that the lines of flow remain parallel to the side of 
the sample, but the points P x and P a no longer are at the same poten- 
tial. For electron*., which are carriers of negative charge, P 2 must be 
negative with respect to P*. This is the case, for instance, for alkalis. 


* Notice that v for an eleotron is directed from 
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For many metals, such as zinc, the sign is reversed, as if positive car- 
riers were taking part in the flow of current. The band theory accounts 
for this readily : It seems fairly obvious — and may be proved strictly — 
that the “ hole ” in an almost fully occupied band acts like a positive 
charge. In an instance where two bands overlap as mentioned above, 
which happens for the elements of the second column of the periodic 
table, the conduction is made up of two parts — thatW the holes and 
that of the electrons of the higher band. If the former preponderates, 
the sign of the Hall effect reverses. 

The next task is to compute the conductivity and its temperature 
dependence on the basis of our picture of electron waves permeating 
the metallic crystal. As long as the lattice is. a perfect one there will 
be no possibility, according to the above treatment, of transferring 
energy to the atoms of the lattice, i.e. a uniform metal has zero resis- 
tance at absolute zero. At higher temperatures, however, the perio- 
dicity is disturbed by thermal motions and the randomly distributed 
departures may be introduced into the computation as a perturbation. 
The calculation is cumbersome but it succeeds in giving a satisfactory 
picture of conduction. 

6. The Role of Lattice Defects in Dielectric Crystals. 

It was stated in the preceding section that a materiakis an insulator 
or a conductor according as the uppermost energy band is completely 
or partially occupied. In rare cases, the nearest unoccupied band may 
lie such a short distance above the highest fully-occupied one that even 
at room temperature occasional electrons may find their way into the 
upper band. Each electron thus elevated contributes in two ways to 
the conduction: firstly, because this electron is itself mobile, and 
secondly, the hole now remaining in the previously fully-occupied band 
is able to move (“ hole conduction ”). Since the number of electrons 
elevated in this way is governed by the Boltzmann distribution, the 
conductivity of such truly electronic semiconductors increases with the 
temperature according to e~ wlkT , where W is the excitation energy. This 
temperature dependency is well founded experimentally. 

With other substances, e.g. boron, the conductivity of different 
samples is often found to vary by several orders of magnitude, even at 
the same temperature. The cause, recognized only relatively recently, 
is the great influence of lattice defects or imperfections. These may be 
atoms of impurities, holes, or atoms of the material itself which occupy 
locations between lattice points. If they are present in concentrations 
as low as even 10~ s they are able to alter the conduction properties 
completely. The lattice defects constitute the , critical factor in the 
technical application of a tremendous number of materials: oxide 
cathodes of electron tubes, dry rectifiers, detectom, transistors, selenium 
photocells, phosphors, and photographic emulsions. Preparation of 
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suitable materials for such uses amounts to finding ways of introducing 
the proper defects into the substance. We shall consider two classes of 
materials here — phosphors and photographic emulsions. 

One of the . oldest and most widely used phosphors is ZnS con- 
taminated with traces of other heavy metals or rare earths. It may be 
fabricated in forms that show long-lasting phosphorescence, nr others 
where the decline is rapid, such as those used to coat cathode-ray and 
television receiving tubes. In the level scheme of such a phosphor there 
is an impurity band C, of finite width, lying higher than the fully- 
occupied band A of the main material (fig. 8). An electron at the 
location of the impurity ion has somewhat higher energy than one in 
the main lattice. In every phosphor there is a second kind of defect 
fulfilling the requirement that the whole crystal be electrically neutral. 
If, for example, the place of a Zn ++ ion is taken by a Cu+, there will 
be an excess of negative charge in the neighbourhood. This may be 
compensated, for instance, by the removal of one S — ion for each two 
Cu+ ions present. Since a vacancy of this kind corresponds to a positive 
charge, it will attract and hold electrons, acting like an electron trap. 
The corresponding energy level D lies just below the conduction band B. 

The mechanism of phosphorescence is then visualized as follows: 
A photon absorbed by the main substance ejects an electron from A 
and sends itr upward to B.* In B the freed electron interacts with the 
lattice elements, losing energy thereby and descending to the lower 
edge of band B. Its return from this point to A, which would violate 
a selection rule, is prevented by the circumstance that its former place 
in A was soon filled by an electron from the impurity level. A lattice 
defect in the latter level is completely surrounded by normal lattice 
elements, and so such a transition can take place readily. 

The conduction electron, whose existence can be directly verified 
electrically, moves through the lattice and is trapped in D. Passage 
from here to a vacant place in C will be possible only if the level D 
happens to lie near an impurity atom in which there is a vacancy. This 
seldom happens, and the electron usually is thrown back into the con- 
duction band by the absorption of thermal energy. This Jaappens 
repeatedly until the electron chances to fall into a properly placed trap 
from which it can return to an available place in an impurity ion. The 
latter transition is accompanied by the emission of radiation, and the 
phosphor is then back in-its original condition. 

This picture of what goes on, while apparently somewhat artificial, 
succeeds in bringing order to a wealth of observational material. For . 
one thing, it explains at once the strong temperature dependence of the 
luminescence which enables the substance to “ freeze ” the absorbed 
light and then give it out again rapidly upon warming. In order to 

* Tbe rarer transitions C to B and C to D oooor also, but will be neglected here, 
since there are many more absorbing atoms in the ground level. 
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pass from an unfavourably placed trap to a better one located near an 
impurity ion that is able to accept an electron, the conduction electron 
must take up heat energy to bring it back to the conduction band. The 
band theory also explains the selective effect of infra-red light whose 
quanta correspond to the energy difference B — D. The mechanism is 
simpler only for certain solid organic phosphors, where the absorption 
and emission processes take place within the same molecule, the excited 
state being metastable. 

There is a connexion between the explanation of the luminescence 
of inorganic phosphors and that .of the latent photographic image. The 
energy level scheme corresponds to that of fig. 8. The effective im- 
purity centres are known to be S — ions, which find their way into the 
silver bromide granules from the gelatin of the emulsion. When the 
necessity of sulphur impurities was finally discovered, many puzzling 
cases of insensitivity in emulsions were cleared up. 



In order to compensate the double charge of each S — ion, there 
must somewhere be a missing Br~ ion. These Br vacancies are the 
electron traps. Compared with phosphors, the new feature in addition 
to the purely electronic process is this: passage of an electron to a trap 
site is followed by a motion of ions, which leads to the formation of 
sensitivity centres. Even at room temperature, the silver halides 
exhibit an appreciable electrolytic conduction, borne mainly by the 
silver ions. This has to do with the smallness of these ions. In the case 
of a photographic emulsion, there is also a shift of the Br- ions to be 
taken into account, and there is a corresponding movement ofBr* 
holes. Such motion can take place if there is a negative space charge 
at some point. This will happen, for example, if a second electron 
should fall into the Br~ bole. Experiments with alkali halides bear this 
out. In such instances, little clusters consisting of Ag+ ions and Br - 
holes containing one or two electrons are formed. Except for the size 
of the lattice constant, such an aggregate is nothing more nor less than 
a tiny crystal of metallic silver, which serves as a sensitivity centre. 
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The decrease in sensitivity at low temperatures is then understandable 
on the ground of reduced ionic mobility. .Further, the quenching of 
the latent image by infra-red light (Herschel effect) is explained ns the 
elevation of the trapped electron to the conduction band. Also, the 
fact that the photographic density is found to depend not only on the 
product of intensity by time but also on the way in which the energy 
is introduced during exposure is explained by the above considerations. 



CHAPTER XLII 


Nuclear Physics 

1. General Remarks on the Physics of the Atomic Nucleus. 

Now that the structure of the electronic shells of the atom and the 
processes going on in them have been revealed in their main outlines, 
the interest of physicists since about 1930 has chiefly been centred 
on the atomic nucleus. Here both experiment and theory indicate 
relationships essentially different from those which apply to the outer 
layers of electrons, and we shall accordingly begin by discussing these 
differences. 

The most important distinctive feature of the physics of the nucleus 
is the immense amount of energy which, as has long been known from 
the facts of radioactivity, is involved in any nuclear phenomenon. 
While the amounts of energy involved in changes in the outermost 
shell of electrons (excitation, ionization, chemical binding) are of the 
order 5 electron volts, the amounts of energy transformed ip nuclear 
processes are over 10® times as great. This energy ratio is even greater 
than we should expect from the dimensions of the nucleus and of 
the outer shells — that is. by comparing the potential energy of two 
elementary charges, first when at a distance apart of the order of the 
diameter of the atom (10 -8 cm.) and again when separated by a dis- 
tance of the order of size of the diameter of the nucleus (10~ 18 to 10~ la 
cm.). Owing to the magnitude of the energy liberated it is possible, 
in contradistinction to the physics of the outer shells, to observe indi- 
vidual phenomena. In this way we are made aware of uncommon 
occurrences, which, though they elude the methods of large-scale 
physics, may have an extremely important bearing on cosmological 
problems, because there the rarity of the event is compensated for by 
the immense periods of time available. These large amounts of energy 
manifest themselves in the form either of fast corpuscular rays (elec- 
trons, protons and a-rays) or of radiation of very short wave-length 
(y-rays). In the case of the short y-rays, however, it must be borne in 
mind that though we speak of waves practically nothing of a wave 
nature is revealed experimentally, all “ wave-length determinations ” 
being really determinations of the magnitude of the light quantum hv. 

Conversely, these energy-rich corpuscular rays and y-rays' may be 

7S9 
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used to bring about changes in the nucleus. We have them at our 
disposal in the rays emitted by the radioactive elements and in the 
cosmic rays. The latter were discovered by V. F. Hess as far back as 
1910 and have since been the subject of an enormous number of ex- 
periments. It may be taken as established that these rays are diffusely 
incident on the earth from without, coming from all directions in 
space, and that they are a mixture of the most diverse types of ray, 
whose energy may reach !0 17 electron volts and perhaps even more. 

In recent years we have found means of giving protons, deuterons 
and a-particles velocities far in excess of those of “ natural ” radia- 
tions (except those of cosmic-ray particles). Thus, in the cyclotron 
a-particles may be given energies amounting to several thousand 
million electron volts, as compared with a-rays from radioactive sub- 
stances, whose energy is never as much as ten million electron volts. 
Further, particularly energetic particles are obtained also from arti- 
ficially radioactive elements (cf. § 4). 

Several methods are available for detecting the individual pheno- 
mena. The simplest of these is the scintillation method, which is very 
suitable for a-particles and hydrogen nuclei or protons. The energy of 
these is so great that when an individual particle hits a screen of zinc 
blende it gives rise to a flash of light which is visible in a microscope. 
Owing to the differing brightness of the flashes it is possible to dis- 
tinguish between a-particles and the feebler protons. The velocity of 
the particles is determined from their range, i.e. the distance from the 
source of radiation, in air at 0° C. and 760 mm. pressure, at which they 
can no longer produce scintillation. The relationship is an empirical 
one; for a-particles the formula 

R = 0-965 X lO- 27 ^ 3 (1) 

where R is the range and v the velocity, is widely applicable. Accord- 
ing to Blackett’s experiments, in which the velocity was obtained from 
the magnetic deviation, the formula 

R = 1-77 x (2) 

is more satisfactory for protons. A formula which has a more accurate 
theoretical basis, but which is very inconvenient in practice, has been 
given by Bethe. Reference may be made to equation (5) on p. 742. 

Still more elegant and direct is the cloud chamber method due to 
C. T. R. Wilson. In this method the paths of the particles are made 
visible owing to the ions formed along these paths being caused to act 
as nuclei for the condensation of supersaturated water vapour. Here 
again the nature of the particles is revealed simply by the intensity 
of the track (the paths of electrons can also be made visible in the 
same way). As before, the velocity is obtained from the range, that is, 
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the length of the track. A better method consists in placing the chamber 
in a magnetic field and measuring the resultant curvature of the track. 
This method is especially useful in revealing the sign of the charge 
carried by the particle. Recently it has been found that a-particles and 
protons leave tracks in a photographic emulsion — a method that is 
valuable for recording rare events without requiring special attention. 

Again, the ions produced by a particle can be detected directly by 
means of specially sensitive electrometers, such as those due to Hoff- 
mann. Or the ions first formed may be made to give rise by collision 
to a large number of new ions, so that less sensitive instruments may 
then be used to detect the total current. This is the principle of the 
Geiger and Geiger-Muller counting apparatus. In both types of instru- 
ment the ions first produced give rise to a gaseous discharge which is 
then rapidly extinguished by a suitable device. Depending upon the 
magnitude of the applied potential, a counter may be made to give a 
current pulse proportional to the number of initially produced ions or 
else each particle may be made to produce a pulse of the same strength.' 
It is necessary to use weakly ionizing particles (electrons) in the break- 
down range. Counting and recording devices may be actuated by 
means of suitable amplifiers. Recently, in place of some of the above 
instruments, scintillation counters have come into use. Here the 
flashes are amplified by photo-electric means and then registered. Suitable 
scintillation materials permit the recording of /3- and y- rays as well. 

For very energetic particles such as those found in cosmic rays 
(§ 15) it is often difficult to distinguish their nature. The charge is 
usually recognized as a multiple z of the electron charge, but the rest 
mass and the velocity (or the kinetic energy) are unknown. Although 
the requirements are usually prohibitive, it may be possible to observe 
the path of the particle in a strong magnetic field directed normal to 
its path. In such case, equation (14) of p. 429 furnishes one relation. 
The product Hp, called the magnetic rigidity, tells us the value of 
mvc/ze. But at the extremely high velocities encountered, the relat- 
ivistic mass increase becomes important, and the value sought is the 
rest mass which may be very much smaller than m. 

Properly speaking, the product Hp gives us a relation between 
m 0 and the kinetic energy E. Thus in equation (14), referred to above, 
m is to be replaced by mjV 1 — v i jcr. From equation (82) on p. 257, 

E =”^(vnb , (3 > 

This may be solved for v. Using the result and also m 0 /-y/( 1 — vP/c 2 ), 

tve find that 

Up = ~ VW+ 2 mjle*. 


W 
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A second relation is found from the energy loss per unit path length 
which the particle experiences by ionization of the air or other material 
through which it travels. Bethe derived a relation for very high energies 
E ;§> eF { , where V t is the complete ionization potential of the atoms 
composing the medium; it may range as high as 10 kv. His equation 
for the energy loss per unit distance is 

The mass of the particle does not appear, so that all particles having 
the same charge and speed will suffer the same loss. The charge number 
of the particle is z, while the atomic number of the atoms of the medium 
is Z and their mean ionization potential is V t . a The number of atoms 
per unit volume of the medium iB N and Q is a correction terra which 
may be neglected at high velocities. 

Inspection of the equation discloses that the energy loss, and hence 
the intensity of ionization, decreases rapidly as the particle speed in- 
creases, approaching a flat minimum. For protons in air, this minimum 
corresponds to energies of about 2300 Mev. The number of droplets 
per unit length in a cloud chamber or the number of developed grains 
per unit length on a photographic plate increases with the number of 
ion pahs, so that the energy loss can be found empirically from such 
observations. The thickness and curvature of the tracks in a magnetic 
field almost always permit of unique identification. In the absence of 
a magnetic field, the energy may be found from the range of the particles. 
Suppose the energy loss has been determined as a function of the dis- 
tance traversed. Numerical integration will then give E as a function 
of the distance x and hence —dEjdx as a function of E. Then the range 
is given by 

i-fe — fjs* 11 - < 6 > 

Since R is observed, this is an equation for the determination of the 
unknown initial energy E 0 . Because of the uncertainty of the energy 
loss measurements, one usually starts from the empirical range-energy 
relation for 15 Mev particles and integrates from this point up to the 
observed range. As a result of the simple dependence on z and on the 
particle mass M, equation (6) is especially suitable for computing the 
range of a particle of mass M' and charge z‘ from the known range of 
a particle of mass M and charge z having the same velocity.* Taking 
the energy, as usual, to be the abscissa, corresponding points will have 
different abscissas as well as different ordinates. (Cf. Ex. 134 opposite.) 

* dEjdx is independent of M, but E, and therefore dE, is proportional to M, 
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In the case of electrons, there are two other mechanisms of energy 
loss in addition to that due to ionization: Compton effect and Brems- 
strahlung (according to even the classical theory, an accelerated electron 
or other charged particle radiates electromagnetic waves — the con- 
tinuous X-ray spectrum). Further, the formula for dE/dx includes 
other terms that can no longer be neglected when dealing with electrons, 
so that the range formula (5) is no longer applicable to electrons. For 
heavier particles the retardations are never so great as to make the 
Bremsstrahlung important. 

So far as the theory of nuclear physics is concerned, the goal we set 
ourselves is the deduction, by application of wave mechanics, of the 
various nuclear states from some sort of model, as was accomplished 
in the case of the electron shells. Matters are still very much in a state 
of flux, but a number of important results have been obtained from 
the principles of conservation of momentum, energy and charge, whose 
validity is assumed to extend to the nucleus. Of quite special impor- 
tance is the law of the inertia of energy (cf. p7258), which takes the place 
of the conservation of matter. It enables us to calculate the release of 
energy in a nuclear reaction from the mass defect. In this connexion 
the following transformation formulas should be noted: 

I Mev = 1-60 X 10"-* erg = 1*78 X 10 -27 gm., . . (7) 
Mass of electron = 5-12 X 10 5 electron volts. . (8) 

Thus if a mass decrease of 10~ 2 3 a.m.u. (p. 765n) should occur in a nuclear 
reaction, energy amounting to about 1 Mev would be liberated. 

Ex. 134. An a-particle (M ="4, s = 2) with an energy of 12 Mev has an 
observed range of 14-42 cm. Find the energy Of a proton (M => 1-008, t — 1) 
which will have this range. 

2. The Hyperflne Structure of Spectral Lines as a Connecting Link 
between the Physics of the Outer Shells and that of the Nucleus. 

In addition td the methods peculiar to nuclear physics which were 
described in the previous section, spectroscopy , which is a method of 
ordinary large-scale physics, has its contributions to make to the 
problems of nuclear structure. One way in which spectra are affected 
by the nucleus depends in the first instance on the joint motion of the 
nucleus and its surrounding shells; this, however, is only of importance 
in the lightest elements. In single-electron systems the motion about 
the common centre of mass can be calculated directly; we have already 
carried out this calculation on p. 652 for the interval between the H 
and He + lines. This effect of the mass of the nucleus has gained fresh 
significance in connexion with the lines of the isotope of hydrogen of 
mass 2 discovered by Urey, Brickwedde, and Murphy, whose nucleus 
is called a deuteron. The lines of the isotope are found by replacing 
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M b in equation (31) (p. 652) by twice its value. The proportion in 
which the two isotopes occur may be deduced from the ratio of the 
intensities of their spectral lines. While ordinary hydrogen contains 
only one part in 5000 of the heavy isotope, it is easy to obtain a richer 
mixture, owing to the magnitude of the relative difference of mass. 
(With other elements, as is well .known, the difficulties of separating 
isotopes in weighable quantity are very great.) One way of carrying out 
the separation, for example, is by the electrolysis of water, in which a 
preponderance of the lighter hydrogen isotope is set free. 

Apart from this effect of the nucleus due purely to its mass, there 
is an electrodynamic interaction which gives rise to a splitting up of 
terms additional to that involved in the systematic theory of spectral 
terms developed on p. 669. This is referred to as hyperfine structure. 
The phenomenon has long been known in the case of the mercury 
lines; owing to the slightness of the splitting these- lines have often 
served as tests for interferometers of high resolving pbwer. 

' If we ascribe an angular momentum of magnitude Ih/2-n to the 
nucleus it is possible to develop a systematic theory of the hyperfine 
structure. The total spin of the atom is then given by F (fine quantum 
number), the resultant of I, the moment of the nucleus, and J (inner 
quantum numSer), the moment of the electronic shells. The spin 
vectors / and J are combined to form the resultant F in the same way 
as L and S are combined to form J, J corresponding to the vector L, 
When J > I a figure the same as that on p. 670 but with a different 
interpretation of the vectors gives a hyperfine structure with (21 -j~ 1) 
sublevels to every term. The Zeeman effect may be treated in the 
same way. It is to be noted, however, that owing to the smallness of 
the splitting in fields of the necessary intensity there is an uncoupling 
of the vectors J and I corresponding to the Paschen-Back effect, 
which gives the Zeeman type of the multiplet component in question 
a hyperfine structure of (27 + 1) components. The mechanical moment 
of the nucleus is accordingly determined by the number of components. 
Very many observations of fine structure have been made and they 
show that atoms of even atomic weight have integral, hut in most cases 
zero, nuclear moment ; whereas in atoms of odd atomic weight the nuclear 
moment is always equal to an odd multiple, of one-half. 

The magnetic moment of the .nucleus, however, is not determined 
by the mechanical moment of momentum, as we do not know whether 
the latter is due to circulating electrons or to nuclei. If we replace m, 
the mass of the electrons, in equation (60) (p. 666) by M„, the mass 
of the proton, the magnitude so obtained is called a nuclear magneton-, 
it is accordingly 1840 times as small as the Bohr magneton. If we 
make an approximate calculation of the term perturbation due to a 
magnetic dipole situated at the nucleus, the very small splitting gives 
the order of magnitude of the magnetic moment as that of a nuclear 
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magneton. Direct measurements, using molecular-beam techniques, 
lead to the same result. This furnishes one of the first objections to the 
earlier idea of nuclear electrons, whose spin would give me to one Bohr 
magneton per electron. 

From the discrepancy between , the observed hyperfine structure 
and the separations to be expected from the multiplet rules, Schiiler 
has drawn far-reaching conclusions about an ellipsoidal distribution 
of charge in the nuclei of many atoms, (electric quadrupole moment). 

In addition to this hyperfine„structure occurring in a single isotope, 
there is another observed in elements which consist of a mixture of 
isotopes; here the lines or groups of lines correspond to the individual 
isotopes. The combination of these two effects make it difficult' to 
analyse the hyperfine structure in elements such as mercury which 
have many isotopes.* Inasmuch as the mass effect (p. 651) of the nucleus 
is least for the heavy elements, this type of hyperfine structure shows 
a difference in the nuclear field which is not that of a point charge. 
In fact,, this isotope effect is greatest for the lowest S terms (e.g. the 
6s), whose electrons approach the nucleus very closely owing to their 
large eccentricity. 

3. Radioactivity. 

Radioactive phenomena are amongst the earliest known facts, of 
atomic physicsr On them the establishment of an experimental basis 
for the model atom essentially depended. As is well known, the heaviest 
elements spontaneously emit rays which consist partly of radiation of 
short wave-length (y-rays), partly of fast electrons (/9-rays), and partly 
of He ++ particles (a-rays), the nature and energy of the rays being 
characteristic of the particular heavy atom in question. It was soon 
recognized that the source of these rays is the nucleus of the atom, so 
that the study of radioactivity represents the first phase of nuclear 
physics. As a-rays and /8-rays carry electric charges, the loss of one of 
these particles alters the charge on the nucleus and a new element is 
formed. Loss of an a-particle diminishes the nuclear charge by two 
units, so that the element moves back two places in the periodic table. 

Loss of an electron raises the positive nuclear charge by one unit, so 
that the new elementds one place ahead of the former element in the 
periodic table. These are the celebrated displacement laws of Fajans 
and Soddy. If we trace out a disintegration series, we keep coming 
back to the same place in the periodic table, but with a different atomic 
weight each time. Scientists were thus forced to recognize the existence 
of isotopes, and these were found also among other elements by the 
methods of mass spectroscopy. 

To describe the course of a transformation process as time goes on, 
we have to start from the experimental fact that the probability of 
any particular atom disintegrating in the following instant is quite 
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independent of external conditions, depending only on the internal 
structure of the nucleus. We accordingly represent this probability 
by a characteristic constant A. fihe probability that a selected nucleus 
will disintegrate during the elementary interval of time dt is then A it, 
and if there are a large number ( N ) of nuclei, the number of nuclei 
disintegrating, in the interval dt is NX dt, by the fundamental principles 
of statistics. We therefore have the differential equation 

dN ass —NXdt (9) 

(the negative sign being necessary, as the number N is decreasing). 

Integrating and taking N 0 as the number of atoms when t — 0, 
we obtain the number of atoms left at the end of a finite time, 

N = N 0 e-“ (10) 

That is, the disintegration constant A is the reciprocal of the time 
required for the number of atoms of the original substance present to 
fall to 1/e of its initial value. In addition the half- value period (T) is 
often given, i.e. the time in which the number of atoms present is 
reduced to half its value. Substituting N 0 /2 for N in equation (10), 
we see that 

r = Jlog e 2 (11> 

As a rule the product of disintegration is itself radioactive. If, as 
in the case of uranium deposits, we have a long-lived parent substance 
whose diminution may be neglected, a state, of equilibrium among the 
disintegration products is ultimately reached, atoms of each type 
being produced and being destroyed at the same rate. If we let the 
suffix 0 refer to the parent substance and the suffix 1 to the first dis- 
integration product, the increase in due to the disintegration of the 
parent substance is 

dN^N^dt, 
whereas the loss by further disintegration is 
dNf = dt. 

In a state of equilibrium we accordingly have. 

IVo-lVA, (12) 



That is, the numbers of atoms of the disintegration products of a long-lived 
'parent substance present when radioactive equilibrium has been reached 
are in the ratio of the half-vatue periods. This relationship serves to 
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determine the half-value periods of very long-lived types of atoms 
from those of other atoms which can be measured directly. 

The fact that in the case of the a-rays there is a definite connexion 
between the probability of disintegration and the energy of the par- 
ticles shot out has long served as a stimulus to theoretical speculations 
on the mechanism of disintegration. For the shorter the life of an 
element, the greater is the energy set free in its disintegration. One 
might be tempted to compare this with the great velocity of strongly 
exothermic chemical reactions; here, however, the phenomena are 
quite different. The high velocity of a chemical reaction arises from 
the fact that the heat developed in the reaction increases the number 
and violence of the atomic collisions, whereas in radioactive transfor- 
mations the relationship mentioned above applies to the individual 
atom. The relationship may be represented in a range extending over 
fifteen powers of ten by the following empirical formula involving two - 
constants which are characteristic of the particular radioactive series: 

log A = A + B log E (Geiger-Nuttall) . . (13) 

log A = a -f- bV E. (Swinne). . . . (14) 

The second form exhibits the rapid (exponential) growth of the prob 
ability of disintegration with the energy. This feature, however, is also 
brought out if we regard disintegration from the quantum mechanics 
point of view as the crossing of a potential barrier,. such as we investi- 
gated in a very simple case on p. 721. For this phenomenon, which 
plays a large part in nuclear physics, we shall in future use the ex- 
pression “ quantum mechanics tunnel effect ”, as the particle, so to 
speak, gets past the potential hill by going through a tunnel. 

Attempts have been made to fin$ analogous formulae for f3 disin- 
tegration, but here, as we shall see later (§ 9, p. 763), the theoretical 
aspects are more complicated. For artificially radioactive nuclei an 
empirical relationship (Sargent’s formula) has been obtained which 
corresponds roughly to equation (13), except that the constant A has 
different values according as the nuclear momentum (nuclear spin) 
is altered by 0, 1 or 2 in the atomic transformation in question. 

4 . Artificial Transformation and Excitation of the Nucleus. 

The cohesion between the constituents of the nucleus is so great 
that we may expect a priori that it will be impossible to bring about 
any changes in the nucleus, unless by using corpuscular rays of very 
high energy. While electrons were ugtil recently thought to have no 
effect, Rutherford succeeded as long ago as 1921 in bringing about a 
nuclear transformation by means of a-rays, and proceeded to investi- 
gate the protons (hydrogen nuclei) liberated from hydrogen by a-rays. 
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For the collision of a-particle and proton the calculation carried 
out on p. 841 must be modified, inasmuch as the mass of the nucleus 
struck is not large compared with that of the a-particle, but is actually 
smaller than the latter. There is no difficulty in calculating the motion 
of the two bodies relative to the common centre of mass. The velocity 
imparted to the hydrogen nucleus is very large; if the impact were 
exactly central it would be 1-6 times that of the a-particle. Hence in 
the Wilson chamber the hydrogen nuclei behave in the same way as 
the a-particles except that their tracks are somewhat finer. Their 
velocities are found to be distributed in accordance with theoretical 
expectations. 

The really significant feature of this work of Rutherford’s, however, 
was that he established the occurrence of these hydrogen nuclei or 
protons in nitrogen which previously contained no hydrogen whatever. 
Fig. 1 (Plate II) shows the liberation of a proton from nitrogen. The 
short track going sideways from the point of collision is that of the 
nucleus g 0 r i, which consists of what is left after the nitrogen nucleus 
7 N 14 and the a-particle have united and discarded a proton.* Thus 
the process consists not merely in the destruction of the nitrogen 
nucleus, but in the building-up of a new and heavier nucleus, a super- 
fluous proton being ejected. Protons were subsequently found to be 
liberated from a very considerable number of elements, even the 
heavier ones, especially when very fast artificially produced a-particles 
are used. In all these transformations, so far as Wilson chamber 
experiments show, the a-particle remains in the nucleus. 

The fact that in many cases the energy of the protons far surpasses 
that of the a-particles used to liberate them (the range of the protons 
from aluminium is actually equivalent to a range in air of as much as 
80 cm., the range of the original a-particles being only 5 cm.) shows 
that what we are dealing with here is an elementary nuclear reaction, 
involving a transfer of energy (AE). This we shall reckon positive, as 
in thermochemistry, when energy is set free in the course of the trans- 
formation. If we attach the suffixes a, II, h to all quantities referring 
to the a-particle, the proton, and the newly-formed- nucleus respec- 
tively, we have the energy equation. 

Pf a « a a + AE — \M u v* + $M k W- ■ ■ (15) 

To express the conservation of momentum we denote the angles 
which the paths of the proton and of the nucleus make with the path 

* In nuclear physics it is customary to designate a given isotope by attaching the 
atomic weight as an index (upper right) and the nuclear charge as a suffix (lower left) 
to the chemical symbol. Further, a reaction is often written down in abbreviated form 
by putting first the bombarding particlr and then the emitted particle in brackets 
between the initial nucleus and the final nucleus. Here, for example, we should write 
,N I4 (a, p) 8 O l7 . The symbols used are a — He++, p = jH 1 , d = ,H 2 , n = neutron. 
e~ --- electron, e — positron, y — light quantum, v = neutrino, m — meson. 
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of the incident a-particle by rf>, \jj respectively; we then have the 
equations 

M„v h sin <f> — M k v k sin ift = 0 (16) 

M„v u cos <f> + M k v k cos if/ s M x v a . . ■ ■ (17) 


From these we readily obtain 

\M k v k 2 — 2 ^- — 2M a M tt v a v a cos </> + M^ 2 }. (18) 

or, since = AM a , 

\M k »? = ~ {V — 8u H t> a cos <f> + 16u a 2 }. (19) 


Substituting this in (15) above, we obtain an equation which for a 
given value of the energy transfer gives v B as a function of the angle <f> 
and the energy of the a-particle. In many cases the energy transfer 
can be calculated from the mass defect and the distribution of v B is 
in excellent agreement with theory. (The value of v„ is determ ined.by 
the range, that of <f> from stereoscopic photographs.) In some cases 
several groups of protons were found, corresponding to several distinct 
values of AE. This is obviously explained by the fact that when the 
energy of the protons is fairly small the nucleus is left in the excited 
state. The energy of excitation is probably given out later as a y 
quantum; the occurrence of y - rays in nuclear disintegration has 
actually been established. All these observed facts justify the assump- 
tion of the validity of the energy and momentum theorems, as well as 
that of the existence of definite energy states of the nucleus. 

The penetration of the a-particle into the nucleus is obviously 
another case of the quantum mechanics tunnel effect. Here, however; 
the direction of motion is exactly opposite to that in radioactive dis- 
integration, in connexion with which we first encountered the tunnel 
effect (p. 747). The a-particle advances in opposition to the potential 
of the positively-charged nucleus, crosses the potential barrier, and 
reaches the interior of the nucleus. According to p. 747, the prob- 
ability of crossing the barrier decreases rapidly as the difference be- 
tween the energy required and the energy available increases. This 
is the reason why earlier attempts to shatter the nuclei of heavier 
elements with natural a-particles did not succeed; in these, of course, 
the potential barrier is much higher, owing to the higher charge on the 
nucleus. On the other hand, the potential barrier, as was first recog- 
nized by Gamow, is lower for a proton than for an a-particle, owing to 
the single charge of the former; this suggests the possibility of pro- 
ducing sufficiently fast protons to bring about atomic disintegration 
by meanB of potentials available in practice. 



750 


ATOMS, MOLECULES AND SPECTRA [Chap. 

The first of these artificial transformations brought about without 
the use of radioactive substances was the decomposition of lithium by 
protons which had been subjected as hydrogen canal rays to a potential 
of about 100,000 volts, the experiments being made by Cockcroft and 
Walton working in Rutherford’s laboratory. By means of the scintil- 
lations produced on a fluorescent screen they were able to detect the 
presence of a-particles of energy 8-6 x 10® electron volts. The be- 
haviour of lithium, from which the yield is particularly great as com- 
pared with other atoms which can also be disintegrated by protons, 
was subsequently investigated very thoroughly. Wilson chamber 
photographs by Kirchner clearly showed that in each reaction two 
a-particles appear. We accordingly have the equation 

3 Li 7 + rff -f ® = 2 2 He< + 2 . . (20) 

or, in numbers,* 

7-0180 + 1-0081 + 0-0001 = 8-0077 -f 

Using the convenient relation that 1 a.m.u. (atomic mass unit) is 
equivalent to 932 Mev (million electron volts), the above numbers 
show that an energy of 17-2 Mev has been released. The measured 
kinetic energy of both a-particles exceeds that of the initial proton 
by about 17-0 Mev, in excellent agreement with the value computed ' 
by conservation of mass-energy. x 

The fact of the conservation of momentum is also established by 
Wilson cloud photographs. It is found that the two tracks of the 
a-particles going in opposite directions enclose an angle of 5°, the 
value calcinated for protoris with the velocity used being 4° 50'. .The 
discrepancy of 10' is of course less than the experimental error. 

Apart from these 4energv-rich a-particles. the existence of two 
definitely slower groups of a-particles has been established. These are 
the fragments arising from the disintegration of s Li 6 according to the 
reaction 8 Li s (p, a) 2 He 3 . That is, one particle is an “a-particle of mass 
3 ”. Hard y radiation is also observed, arising' from the reaction 
s Li ,? (p, y) s Be 8 . The yield of disintegrations is represented by an ex- 
ponential function, as one would expect in view of the occurrence of a 
tunnei effect. With the lowest potentials (20,000 volts) 10 16 protons 

* Id nuclear reactions we .are of course concerned only with the masses of the 
nuclei, but we actually use the massea of the atoms, i.e. of the nuclei plus the electrons, 
in our calculations. This does not introduce any error, since the number of electrons is 
the same on both sides of an equation representing a nuclear reaction. In problems ot 
stability, in fact, we must carry out the calculations in this way (of. Exercise 136, 
p. 763). 
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are required to bring about one disintegration, whereas with potentials 
of 200,000 volts only 10® protons are required.* 

The disintegration of boron has also been investigated very 
thoroughly. Here, in addition to a-particles of certain definite ranges, 
there is a continuous velocity spectrum having a distinct maximum. 
This is perhaps due to the fact that the reaction takes place according 
to the equation 

5 B u + ^ = 3 2 He 4 + ^r> ■ ■ (21) 


or, in numbers, 

11-0129 -f 1-0081 + 0 0001 = 12-0117 + 3 

<r 

When, as here, there are three fragments the energy and momentum 
theorems are insufficient to determine the distribution among the 
individual particles. The reaction S B n (p, a) 4 Be 8 also takes place. This 
transformation gives rise to particles of definite range (see fig. 2, 
Plate II, facing p. 748), while the reaction of equation (21) corresponds 
to the shorter tracks seen in the picture. 

Particular importance attaches to disintegration experiments with 
fast deuterons produced like the protons as hydrogen canal rays. When 
lithium is bombarded by a mixture of deuterons and protons a fresh 
group of a-particles with the high energy of 12 X 10 6 electron volts 
is produced, over and above the groups liberated by bombardment 
with protons only. On the basis of the mass defect this new group 
corresponds within the limits of experimental error to the disintegra- 
tion of the lithium isotope 6 in accordance with the equation 

a LJ8 + iH 2 + = 2 2 He 4 + 2 . (22) 


or, in numbers, 

6-0169 + 2-0147 + 0-0001 = 8-0077 + 2 

While nuclear transformations by means of fast electrons have 
hitherto been observed in a few cases only, y-rays of sufficiently high 
frequency are more effective. Of particular importance for what 

* For the reaction jH*(<2> p), H* D6pel obtained the surprising result that the lower 
limit is experimentally fixed by the leaBt potential required to produce canal rays and 
not by the nuclear reaction coming to an end. The fact that nuclear reactions do take 
place, if seldom, at velocities as low as those corresponding roughly to the velocities 
of gas molecules in the interior of a star at a temperature of a million degrees at last 
provides us with a source of energy for stellar radiation. 
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follows is the observation that bombardment of materials containing 
deuterium gives rise to protons. But what is the residue ? If from a. 
particle of mass 2 and charge 1 we remove a proton of mass 1 and 
charge 1, we are left with an entity having zero charge and mass 1. 
Here for the first time we meet with a new elementary particle, the 
neutron. Strong evidence for its existence was already forthcoming 
from other directions before the above-mentioned observations were 
made; this evidence we shall discuss in the next section. 

5. The Neutron. 

Up to the year 19&2 it was reckoned as among the incontrovertible 
facts of physics that the positive charge is always associated with the 
mass of hydrogen, the negative with the mass of the electron, and that 
these two particles, protons and electrons, form the ultimate con- 
stituents' of matter. However, in that year, in rapid succession, two 
new elementary particles were discovered, the neutron mentioned 
above and the positive electron, or positron. Of these, the neutron has 
revolutionized our ideas of nuclear structure. 

In 1930, Bothe and Becker observed that beryllium and other light 
elements when bombarded by a-particles, emit not only a hard y-radia- 
tion but also a penetrating radiation with entirely different properties. 
In substances containing hydrogen this radiation sets free protons of 
high velocity, although in the Wilson chamber it leaves no traces of 
itself; that is, the proton tracks, so to speak, start from nothing. If 
we begin by assuming that here we are again dealing with an extremely 
hard y-radiation — owing to the great penetrating power a- and ]8- 
ravs are at once ruled out — we are forced to interpret these protons 
as a Compton effect which has taken place in the nucleus, to which 
the calculations of p. 688 apply if we replace the mass of the electrons 
by the mass of the protons. In this way we obtain the value 55 x 10 s 
electron volts for lie hardness of the y-rays, while according to the 
mass relationships the bombardment of beryllium can only yield a 
much smaller amount of energy. The frequency of these proton tracks, 
moreover, is about a thousand times as great as we should expect 
from the assumed Compton effect. Further, a y-radiation would 
necessarily set free photo-electrons also; no trace of these is to be 
found. Finally, the velocity imparted to nitrogen nuclei by the radia- 
tion is three times as great as would be produced by a light quantum 
of 55 X 10® electron volts, in a Compton effect on the nitrogen nucleus. 

We can extricate ourselves from all these difficulties if we assume 
with Chadwick that this radiation consists of charge-free corpuscles, 
which owing to the absence of electric charge cannot have any 
ionizing effect and hence are not retarded ]>y the outer shells of the 
atom. Such particles do not differ from light quanta, which also have 
a mass hvj&, except that they do not move with the velocity of light. 
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Their collision with a nucleus is taken as being like the collision of 
elastic spheres. This means that the forces between the colliding par- 
ticles decrease according to a very high power of the distance. The 
nature of these forces is still a matter of speculation. If we calculate 
the velocities which a proton and a nitrogen nucleus will acquire in 
colliding centrally (these can both be obtained from their ranges), 
the principle of the conservation of momentum gives the two equations 




v y 


2M n v n ' 
M v + M n 

2 M n v n 
M s + M„' 


(23) 


where the indices p, N, n refer to proton, nitrogen nucleus, and neutron 
respectively; hence 


v, _ M s + M„ 
Vy M„ + M„' 


(24) 


From this equation Chadwick obtained the value 1*15 + 0-1 for the 
mass of the neutron. 

The exact mass of the neutron can be found by comparing masses 
in various nuclear transformations. The- simplest to use is the reaction 
]H 2 (y, wJjH 1 (see top of p. 752), which has also been observed to take 
place in the reverse direction, jH^w, y) t H 2 . We have iH 1 = 1*00814 
and jH 2 == 2*01473, from observations with the mass spectrograph, 
taken relative to 8 0 16 = 16. Also hv = 2*23 Mev and E n is approxi- 
mately zero, so that” 

o n 1 = 1-00898 


Owing to the absence of charge on the neutron, which means that 
it can react only with nuclei, it exhibits very remarkable properties 
when made to pass through matter. As there are more nuclei in 1 c.c. 
of paraffin-wax then in 1 c.c. of lead, beams of neutrons are more 
weakened by paraffin-wax than by an equally thick layer of lead. 
This serves to distinguish between y-rays and neutrons. Further, 
embedding the ionization chamber in paraffin-wax reveals an impor- 
tant effect due to the neutrons, for then beams of protons are shot out 
by the paraffin-wax and ionize the gas in the chamber in their turn. 
Neutrons, however, are detected most readily by the fact that they 
give rise to radioactive isotopes of various elements, in particular, 
rhodium. 

The mass (1*00898) and hence the energy content of the neutron is 
greater than the sum of the masses of the electron and proton (1*00814), 
so that the neutron should be unstable with respect to a breakdown into 
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the other two particles. In fact, such “ radioactive decay ” of the free 
neutron has been observed, the half-life being about 12 minutes. 

Nuclear disintegrations brought about by beams of neutrons have 
been observed in great numbers. Curiously enough, the very slow 
thermal neutrons are often the most effective and very sharp “ res- 
onance energies ” are found. As might be expected, r eutrons are much 
more effective than charged particles, for the majority of the latter fail 
to act because their energy is spent in detaching outer electrons from 
the bombarded atoms. 

We may amplify this last point a little. As we saw in the case of 
the disintegration of lithium, most nuclear processes are highly exo- 
thermic reactions. The question naturally arises why these reactions 
do not spread like the explosion of a mass of gunpowder. The reason 
is that it is only in an infinitesimal number of cases that the energy 
of the charged products of disintegration, though very great, can again 
act effectively in a nuclear collision. The possibility of bringing about 
atomic transformations on the large scale is therefore exceedingly 
small. If neutrons are used the conditions are much more favourable, 
as then the only particles which suffer retardation are those which 
actually collide with a nucleus. The usual means of producing neutrons 
by means of a nuclear disintegration yields only a small supply. If, 
however, there were a process which would set free more neutrons than 
were shot in to bring it about, the possibility of a chain reaction would 
exist, provided that the volume of material is so large that only a few 
neutrons are lost. This possibility was opened up by the discovery in 
1939 by Hahn and Strassmann of the fission of the uranium nucleus. 
Whereas all previous artificial nuclear processes involved the splitting- 
off of elementary particles only, fission causes the uranium nucleus to 
break up into two fragments of intermediate weight and a number of 
neutrons. This will be discussed more fully in § 10. 

6. The Positron. 

Anderson and, independently, Kunze found that when a strong 
magnetic field was used, cloud chamber photographs of cosmic-ray 
particles showed tracks of a new type whose curvature corresponded 
to positive particles, but their intensity to electrons. Confusion with 
protons is ruled out by the fact that proton tracks so strongly curved 
could not have the range actually found, and the possibility of a mistake 
about the direction in which the track was traversed was excluded by 
making the track pass through a plate of lead, when the side where the 
curvature is greatest of course represents the later portion of the track. 
Subsequently a large number of cloud photographs showed that these 
tracks almost always occur along with an electronic track originating 
at the same place; in fact, whole “ showers ” of positive and negative 
electronic tracks are sometimes found starting from a single point 
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(cf. fig. 1, Plate III). These double tracks may also be produced using 
y-rays; however, a photon of over 10 6 electron volts is always re- 
quired (fig. 2, Plate III). Soft y - rays do not produce the phenomenon. 

This minimum value of 10 8 electron volts serves as a hint for theory, 
for it corresponds exactly to the mass of two electrons. It is therefore 
feasible to interpret the occurrence of positive and negative electrons 
as meaning that a light quantum becomes materialized, transforming 
itself into one or more “ electron pairs ”. The conservation of charge 
and energy is guaranteed, but the conservation of momentum cannot 
hold, the reason being that the light quantum of all particles of the 
same energy (rest energy -f- kinetic energy) is that with the greatest 
momentum. For by formula (72) on p. 254 and formula (82) on p. 257, 
the momentum of a material particle of rest-mass m 0 is 


and its energy is 


so that 


« m o v 

Pm v'(i - 


= ™<p 

m V(i - 


V V 1 

c 2 c c ’ 


for the light quantum, on the other hand, 

P, = h {, B, = hv, 

so that 



(25) 

(26) 

(27) 

(28) 


Hence the phenomenon can occur only in the presence of a nucleus 
which will take up the surplus momentum. The small energy transfer 
connected therewith may be neglected. 

Apart from this “ materialization ” of light quanta, however, 
Curie and Joliot have observed that positrons also appear when alu- 
minium is disintegrated by o-particles. In this process protons of 
very great range had previously been obtained as a product. Now 
a process in which a neutron and, subsequently, a positron are emitted 
clearly exists as well. Later, positrons were recognized to be entirely 
normal accompaniments to nuclear changes. 

Now why is the positron found less frequently than the electron? 
Even before the discovery of the positron Dirac had developed a 
generalized wave-mechanical theory of the electron (p. 711)*in which 
the spin appears quite naturally. We could not go into this theory In 
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detail, owing to the advanced nature of the mathematics involved. 
Formerly it was regarded as a defect in the theory that it permitted 
the appearance, even if only for the time being, of positive electrons; 
but these very entities have now been observed. The annihilation of the 
positfons obviously results from their union with electrons with the 
emission df y-radiation. Here the momentum theorem can only be 
satisfied if two or more quanta are produced simultaneously. Obser- 
vations on the passage of y-rays through matter make it probable that 
materialization and dematerialization of the y-quahtum play a part 
in addition to normal absorption. 

7. Artificial Radioactivity. 

Curie and Joliot made the discovery that the emission of positrons 
from aluminium continues for some little time after the bombarding 
radiation is cut off, dying away exponentially. They interpreted this as 
meaning that when aluminium is bombarded with a-rays an unstable 
nucleus is formed which is transformed into a stable one according to 
the laws of atomic disintegration. 

Subsequently this induced radioactivity was found to occur in hun- 
dreds of nuclear transformations. This is not surprising, for in most 
cases the balancing of the masses gives isotopes which differ from those 
already known. They are not to be found in nature because, if they ever 
did exist, they must long since have been transformed into the stable 
forms. 

We can, in fact, understand the nuclear transformations discussed 
in § 4 (p. 747) better if we assume that in every case the bombarding 
particle is captured to form an unstable intermediate nucleus, which 
in these cases has a life too short to measure. 1 

Atoms rendered artificially radioactive have found many important 
applications in medicine and in industry. For example, by administer- 
ing small amounts of radioactive sodium to a patient, its progress' 
through the blood stream can be followed by means of a Geiger counter. 
Similar experiments can be performed with plants in order to find out 
how they assimilate and distribute foods. In engineering, the rate of 
wear of bearings has been determined by previously introducing a 
small amount of a radioactive metal into the bearing metal and then 
measuring the activity of the lubricating oil. 

In all artificially radioactive elements the emission of either Elec- 
trons or positrons is observed and sometimes y-radiation as well. The 
reason for the preponderance of the first two phenomena, collectively 
referred to as ^-transformations, will be made clear in what follows. 
The occurrence of y-rays accompanies many processes. In addition, 
there is a process in which the nucleus captures an external electron. 
One might expect that a positron ejected from the nucleus in a /?+- 
process could combine with one of the external electrons. If this were 
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the case, one would always observe y- radiation of 0*5 Mev energy as 
described above for positron annihilation. However, this radiation is 
never found, and only the K series of the element in question is observed. 
It must then be assumed that in this process a K electron plunges into 
the nucleus and then, as is usual in K emission, the place thus vacated 
is filled by another electron from the L, M, or some higher shell. How- 
ever, in order to preserve the energy balance in such K capture, it has 
become necessary to assume the existence of another kind of particle, 
the neutrino, which has only recently become available to direct observ- 
ation. This will be discussed further in § 13. 

8. The Role of Protons and Neutrons in Nuclear Structure. 

The neutron opens up a new way of explaining the difference be- 
tween atomic weight and nuclear charge, and thus enables us to drop 
the assumption of nuclear electrons. The following are the chief objec- 
tions to nuclear electrons: 

(1) The small magnetic moment of the nucleus, already mentioned 
on p. 744, which according to all the observations is of the order of a 
nuclear magneton, not of an electronic magneton. 

(2) From other observations, the proton and probably every 
elementary particle has a spin of 1/2. Experimentally it is known 
(p. 744) that nuclei of even atomic weight have spins of even number, 
nuclei of odd atomic weight spins of odd number. This is incompatible 
with the assumption of nuclear electrons. For if Z„ is the number of 
protons, the assumption that the nucleus is built up of protons and 
electrons means th it the atomie mass A is equal to Z„ and that Ze, 
the number of electrons in the nucleus, is equal to Z„ — Z. The sum 
of protons and electrons is Z P+e = 2Z„ — Z = 2A — Z. Whether 
this is even or odd, then, depends on the atomic number Z, not on the 
atomic mass A. Oh the other hand, by the laws of combination of 
sums given on p. 669, an even number of particles always gives a total 
spin which is a whole number, and an odd number of particles a total 
spin which is a multiple of 1/2. Hence jH 2 and fN 14 , which have odd 
atomic numbers, would each have a spin which is a multiple of 1/2, 
whereas it is actually found to be 1 in both cases. 

(3) According to the indeterminacy relation, if the electron is to 
be restricted to a space of nuclear dimensions it must have a momentum 
of the order of 6 X 10 -16 gm. cm./sec. and hence an energy of about 
100 Mcv. Thus, although we obtain for the electronic shells term 
values of the energy states of the correct order of magnitude, the 
values in the case of the nucleus are too high by more than a factor 
of 10. 

But how are we to explain ^-disintegration if there are no electrons 
in the nucleus? The following account of the state of affairs may be 
plausible by analogy. If an atom emits a quantum of visible light we 
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do not conclude from this that the electronic shells contain light quanta, 
but on the contrary that the light quantum originates during the 
passage of the electronic shells to a lower energy state. Now the energy 
relationships in the nucleus are such that the energy set free in the 
passage to a lower state is sufficient to cover the rest mass of the elec- 
trons and their velocities. We therefore assume that the emission of a 
nuclear electron is associated with the transformation of a neutron into a 
; proton , and the emission of a positron with the transformation of a proton 
into a neutron. But, as in the case of the emission of light, we cannot 
attempt to describe exactly how the process takes place. The charged 
particle emitted then serves to balance the charges. Instead of the 
emission of positrons, however, an electron of the K shell may be cap- 
tured by the nucleus, in which case only, lines of the K series are 
emitted (p. 654). 

If we assume that nuclei are built up of protons and neutrons we 
obtain the energy of binding as the equivalent of the total mass of all 
the protons and neutrons less the actual nuclear mass.* This is always 
less than the sum of the masses of the individual neutrons and protons; 
that is, when they combine to form a nucleus, energy is set free. Other- 
wise, of course, nuclei would not be stable. The matter may be illus- 
trated in a particularly clear way by means of a graph: Relative to 
a set of axes in three dimensions we plot Z P , the number of protons 
(which is equal to the charge on the nucleus) along the x direction; 
Z„, the number of neutrons (which is equal to the difference between 
the atomic mass and the atomic number) along the y direotion; and 
the (negative) energy of binding downward in the z direction, so that 
the nuclei of least energy lie lowest. In this way we obtain a surface, 
although it is true that we know only a narrow strip of it corresponding 
to the known natural and artificially produced isotopes. But from this 
we can surmise that the surface has the form of a trough falling away 
from the origin. The bottom of this valley contains the stable nuclei 
and at first runs diagonally in the first quadrant of the xy plane, the 
origin being the highest point (fig. 1). 

If we look at the numerical values of the energies of binding we 
see that they run from 0 to 1800 Mev. Since 932 Mev — 1 a.m.u., this 
corresponds to a change in atomic weight of up to 1-9 units. The fact 
that pure isotopes have atomic weights which are approximately whole 
numbers when all the values intermediate between 0 and 1-9 can occur 
is due to the fortunate choice, of O = 16 as a standard of atomic weight. 
As the atomic weights of the proton and the neutron exceed unity by 
0-008 or 0-009, 230 particles will make the difference mount up to 1-9, 
so that the mean binding energy per particle is about 0-0085. Hence, 

* In computing this difference, the mass of the H atom is to be used in place of the 
proton mass, and in any case the mass of the entire atom rather than merely the nuclear 
mass. (Seep, 763.) 
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is the surface falls away fairly evenly (see fig. 2), the atomic weights 
are approximately whole numbers. 

The dots in fig. 1 represent the known stable isotopes. At the start, 
the dots lie on the diagonal line in the first quadrant because the atomic 
weight is twice the atomic number for the light nuclei, i.e. the number 
of neutrons is equal to the number of protons. All points lying in the 



plane Z v = const, parallel to the Z n axis and perpendicular to the 
plane of the paper have the same nuclear charge— they are isotopes. 
All points lying on a vertical plane (Z„ -f Z„ — const.) parallel to th<? 
diagonal of the second quadrant of the xy plane have the same mass — 
they are isobars. 

This method of approach enables us to deduce much information 
about the stability of nuclei relative to the transformations we know 
to occur on the emission of a-rays, electrons or positrons (both of the 
latter referred to as /S-transformations). Only the state of least energy 
corresponds to a system in equilibrium. In thermodynamics it was 
emphasized that it is the minimum of the free energy, not of the total 
energy, that is involved; nevertheless, this distinction is not important 
in nuclear reactions taking place on earth because the thermal energy 
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is negligible compared with the nuclear energy, which means that we 
are practically at absolute zero, so far as nuclear reactions are con- 
cerned. Thermodynamic considerations, however, are quite relevant 
in the interior of stars, where temperatures, of millions of degrees are 
believed to exist. In a-disintegration, where both Z v and Z n are 
diminished by 2, the magnitude of the energy E must satisfy the 
relation 

E(Z V , Z n )>E(Z„ - 2, Z„ - 2) + E(2, 2). . . (29) 

As we may regard the change by 2 units approximately as a differential, 
we may write instead 

2 (st + s|) >£<2 ' 2 >' (30) 

i.e. the steeper the fall of the energy surface, the more assured is the 
stability. If, for the known nuclei, E is now expressed as a function 
of the atomic weight, this corresponds to the intersections of the 
bottom of the valley with the planes Z V A-Z n — const. We thus 
obtain fig. 2. From it we see that, beginning with atomic weights of 
about 120, the inclination of the surface decreases, and a-disintegration 
occurs only among heavy nuclei. 



In order to discuss the stability of ^-disintegrations, the surface 
concept must be further refined. The energies of binding calculated 
from the atomic weights are such that with a more accurate plot we 
can no longer make all the points lie on a smooth surface; the surface 
becomes crinkled. If, however, we join the points for which Z v is even 
and Z n is even (A even), those points for which is odd and Z n is 
odd (A even), and, finally, the points for which A is odd (either Z v or 
Z n odd), we obtain three smooth surfaces which we denote * by E„, 
E w and E uu . Then an isobaric section gives a single curve of inter- 

* It is not necessary to distinguish between E m and 
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section, or two curves, according as it corresponds to an odd or even 
atomic weight. This is shown for an even atomic weight in fig. 3a. 
The E uu curve lies above the E u curve, from which we conclude that 
in general nuclei for which Z v and Z n are odd are unstable with respect 
to /?- transformations , since, of course, the nexh lower point can be 
reached by such a change. Actually there are only four nuclei of this 
type: jD 2 , 3 Li 6 , 5 B i0 , 7 N 14 . The existence of these nuclei is possible 
because at the start the walls of the nuclear energy surface are not far 
apart and are steep, so that we have the stage of affairs shown in 
fig. 36. The favouring of the type ee is rightly associated with pairwise 
cancelling of the spins. As the ^-disintegration takes place in an iso- 
baric section, it is only a question of whether there is a nucleus lower 



Fig. 3 a Fig. 3* 


down in 'the immediate neighbourhood — at any rate, double trans- 
formations with emission of two electrons have not so far been ob- 
served. If we make the section through a nucleus of the type we, there 
are other nuclei at unit distance, which are ^3- unstable since they lie 
higher. Hence if the atomic weight is odd, there are no stable isobaric 
substances. This .is Mattauch’s first law of isobars. 

Since the E uu surface has no stable nuclei beyond 7 N 14 , all the points 
corresponding to stable nuclei of even atomic weight must lie on the 
E ee surface, but they are 2 units apart. That is, a ^-disintegration 
results in passage to the E m surface, with the formation of a new un- 
stable nucleus. From the position of the two surfaces we see from 
fig. 3a that here 3 stable nuclei are possible, while the two next ones, 
with the relationships, we have assumed, have a lower neighbour in 
the E uu surface, i.e. they are ^-unstable. We thus obtain Mattauch’s 
second law of isobars: If the atomic weight is even there are several stable 
isobaric substances, but they are never those for which the nuclear charge 
differs by unity. 

In order to arrive at a statement about the possible isotopes, we 
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project the points of the energy surface and the bottom of the valley 
on to the xy plane. We can immediately omit all the nuclei of type uu 
as unstable (in fig. 4, in which the stable nuclei which actually occur 
are shown by black circles, the unstable nuclei are crossed out). If, 
for example, we consider the isotopes of Br, the two nearest the bottom 
of the valley are M Br 79 and 35 Br 81 . If we describe isobaric sections 
through them, within which /^-transformations take' place, we see that 
they lie lowest in these also, as they are nearest to the bottom of the 
Valley. The nearest possible isotopes, however, lie higher in their 
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isobaric sections than other nuclei, i.e. they are j8-unstable. We thus 
obtain Aston’s isotope rule: elements of odd atomic number (such as 
Br) have at most two stable isotopes. In the same way, we see imme- 
diately from the figure that a larger number of stable isotopes is pos- 
sible when the nuclear charge is even, especially when the E, e and 
E ue surfaces are far apart. A good many of the /3-unstable atoms 
marked by empty or crossed-out circles actually have been found in 
nuclear transformations, so that now we know something of the region 
around the valley bottom.* 

By means of the energy surface we can also explain the remark- 
able “ resonance energies ” mentioned in § 5 as being observed in the 
binding of neutrons, i.e. particularly effective velocities which often 

* See, for example, Sullivan, W. H., Trilinear Chart of Nuclear Species, New York, 
Wiley, 1949. 



NUCLEAR PHYSICS 


763 


XLII.] 

amount to only a few electron volts and thus are of quite a different 
order of magnitude from the nuclear energies. If we go along the 
valley of stable nuclei from one isobaric section to the next, the surface 
falls by about 8 million electron volts. In adding a neutron we are, it 
is true, not moving in this direction of greatest fall, but in the isotopic 
section; however, the new nucleus will in any case always lie several 
million electron volts lower. On the other hand, an excited state of 
this nucleus may come to about the same level as the original nucleus, 
and this energy of excitation corresponds to the order of magnitude 
of the nuclear energies within a few million electron volts. If now the 
approaching neutron has sufficient kinetic energy to supply exactly the 
amount needed to make the energies correspond, binding takes place 
without any accompanying phenomena. The excited nucleus then 
passes to its normal state with the emission of y-rays and there follows 
a ^-transformation to the lowest point of the isobaric section in which 
the new nucleus lies. Excited nuclei, like excited atoms, can be meta- 
stable, and then have a life which is long in comparison with that of 
metastable atoms, amounting even to hours. This explains' the fact of 
nuclear isomerism — the occurrence of nuclei of the same atomic weight 
and the same charge, but with differing lengths of life. 

The representation by means of the nuclear energy surface, although 
very easy to follow, does not give a perfectly accurate description of 
the stability relationships in the isobaric section. It is on the difference 
in atomic weight that the relative stability of two nuclei depends (see 
Ex. 135). Owing to the slight difference between the mass of the hydro- 
gen atom and that of the neutron, however, the difference in energy 
between two nuclei in the same isobaric section is not exactly equal 
to the difference in atomic weight, for in advancing along the isobaric 
section the ratio of the number of protons to the number of neutrons 
changes. Hence in critical cases it is advisable to plot the deviation of 
the atomic weight from integral values over the ZN plane in the various 
isobaric sections. If this is done, however, the various isobaric sections 
can no longer be brought into relation with one another. 

Ex. 135. Show that a nucleus of atomic mass A and nuclear charge Z is 
stable as regards (3-transformation and transformations involving X-capture if 
its atomic weight is less than that of its neighbours Z + l and Z — 1. (Write 
down the equation for the conservation of energy, taking into consideration, the 
mass of any electrons produced or destroyed.) 


9. The Liquid Drop Model of the Nucleus. 

If we really want to calculate the binding energies and the stationary 
states of the nuclei, we must make a model of the configuration of the 
nuclear components. Here, unlike the situation for the atom as a whole, 
there is no central body, so that the planetary model is not appropriate. 
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The similarity between the constituents — the difference between 
charged and uncharged particles does not play a big part here and is 
neglected in what follows — suggests that we take a liquid drop as our 
model. To explain t-he very strong cohesion of this droplet we have to 
introduce a new kind of “ nuclear force ” acting between the con- 
stitutents, no matter whether they are charged or not. The fact , that 
the energy always comes to about the same amount, about 8 Mev 
for each new particle taken in, shows that the forces (like those binding 
a liquid together) fall off rapidly with the distance, so that only the 
interactions with immediately neighbouring particles are important. 
With forces which, like electrostatic forces, diminish relatively slowly 
with distance, the energy of the mutual action between the newly 
arrived particle and all those previously there must be taken into 
account, so that the binding energy per particle must rise rapidly with 
the number of particles already present (see below). In actual fact, 
experiments on the scattering of fast protons by protons have shown 
that for very close approaches there is a deviation from the classical 
scattering formula derived from the laws of electrostatics, which 
proves that, contrary to the Coulomb repulsion, there exists a force of 
attraction which decreases rapidly with distance. 

The droplet model even yields the finer details of the situation, for 
example the existing slight progressive change in the binding energy 
per particle, i.e. the sum of all the particle masses minus the nuclear 
mass, all divided by the nuclear mass, or the quantity SMjA. If we 
think of the constituents of the nucleus as in the closest spherical 
packing, each particle has 12 neighbours, and the energy set free by 
bringing in another particle is 6U 0 , where TJ 0 is the binding energy 
between two particles. From this, however, we have to subtract two 
quantities: (I) a diminution of the binding energy of the particles lying 
on the surface, each of which interacts with only six neighbours, and 
(2) the electrostatic energy of the protons which owing to mutual 
repulsion has the opposite sign to the energy of nuclear attraction. So 
far as the former is concerned, we can consider its variation with the 
number of particles A, as follows: The volume increases in proportion 
to A, so that the radius of the droplet increases as A vs . Its surface, 
therefore, varies as A 2/3 , and the contribution of this quantity per par- 
ticle is proportional to A m [A= A~ 1/3 . 

For the electrostatic energy, the slow decay of the forces makes it 
necessary to calculate the energy of a sphere of radius a having a 
uniform charge density p as in Ex. 80, p. 290. The energy is propor- 
tional to p 2 a 5 = p 2 A 5/3 . Since the ratio of protons and neutrons does 
not change much with atomic weight, we may regard p as constant to 
the degree of approximation sufficing here, and the contribution per 
particle becomes approximately proportional to A 2/s . Hence the re- 
lation-between A and the binding energy per particle is expressed by 
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a formula with three constants, 

- ^ = 6C7 0 - K c A m . . . . (31) 

With the empirically determined values of U 0 , K 0 , and K 0 this formula 
expresses the relation surprisingly well, as shown by fig. 5 (p. 766). 

The left-hand scale gives the binding energy per particle expressed 
in millions of electron volts and in thousandths of an atomic mass unit. 
The right-hand scale is the so-called packing fraction, given in ten- 
thousandths of a unit. This quantity varies in close correspondence 
with the binding energy per particle.* 

The increase in the electrostatic energy has the further consequence 
that, with increasing atomic weight, the lowest energy state of a 
nucleus no longer occurs when the number of neutrons equals the num- 
ber of protons. Instead, for the stable nuclei, the number of neutrons 
increases more and more with respect to the number of protons until 
finally only a-unstable nuclei are possible. 

We shall refine equation (31) somewhat and at the same time put 
it into a form which makes it easier to recognize the meaning of 
the constants. In order to account for the slopes of the sides of the 
nuclear energy surface we must add a term representing the neutron- 
proton ratio. The lighter stable nuclei have N = Z, while for the 
heavier ones there is a progressive increase in the neutron excess. From 
this we conclude that the binding energy is greatest for N = Z, and 
that the deviations in the total energy are attributable to the increase 
in the electrostatic energy. Consequently, we add a term in (N — Z) 2 . 
By analogy with the case of binary solutions in physical chemistry, 
this energy term will depend only on the relative concentrations, so that 
it will take the form b(N — Z) 2 /(N ~f- Z) 2 . Further, we represent the 
nuclear radius by 

r = r 0 A ™ : . (32) 

Finally, we insert a term representing the splitting of the energy surfaces 
attributable to the pairing of the spins. This term is somewhat harder to 
justify theoretically. Using u — 6U 0 , the result is 

x = ~ M ° + b (r+i) + cA ~ vs + A ~ m + ykA ~ m> (33) 

* It may be useful to summarize the relevant definitions at this point. One a.m.u. 
is A of the mass O 14 , and the isotopic mass of an atom specios is its atomic mass ex- 
pressed in a.m.u. The mass number is the nearest whole number to the isotopic mass, 
and all known isotopio masses fall within 01 of the closest integer. The small difference 
is the mass defect (or mass correction). Finally, the packing fraction is the mass correc- 
tion divided by the mass number. 
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y — +1 for uu, 

= 0 for we, 

— —1 for ee. 

The numerical values of the constants obtained after smoothing out the 
small irregularities of the nuclear energy surface are, expressing 8M in 
thousandths of a mass unit, 

u 0 = 15-7, 6 == 22, c = 16-5, 1 

^== 0-646, k — 36. I' (34) 

6r 0 ’ J 

The constant in the nuclear radius formula (32) has the value 

r 0 = 1-41 X 10 -18 cm. ; (36) 

curiously, this is the radius of a sphere over which the charge e fcould be 
spread uniformly to have the electromagnetic energy corresponding to 
the mass of the electron. The “ classical electron radius ” is represented 

by 2 ^ 0 - 

It must be mentioned that the energy «„ representing the binding 
energy of a nucleon involves not only potential energy [represented by 
the negative term in equation (33)] but a kinetic part (positive terms) 
as well. 


10. The Fission of Uranium. 

This process, mentioned at the end of § 6, may be visualized re- 
markably well by using the liquid drop model of the nucleus. Firstly, 
because of the existence of surface energy, the most favourable form is 
the sphere. However, because of Coulomb forces, a more elongated 
shape may become more advantageous inasmuch as this allows the 
charges to be farther apart, on the. average. In fact, a splitting into two 
spheres can result in an increase in energy. As for water drops, supply- 
ing energy can bring about vibrations that result in a necking-in of the 
drop near the middle. The question arises as to when such a situation 
will become unstable, leading to actual splitting into two fragments. 

In order to investigate the stability we develop the functions of 
the radius — surface area, volume, electrostatic energy — according to 
powers of a function of the polar angle 8. With departures from the 
spherical form, r will depend on 8, but we still assume that the drop 
has symmetry of rotation. Instead of the angle 6 itself, we use spherical 
harmonics — simple polynomials in cos 6 (see pp. 819-824). The advan- 
tage is that these functions represent solutions of the equation of 
vibration of the drop. Because of the symmetry of the vibrating drop, 
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/(it — 6 ) must equal /($), and the uneven powers of cos 9 must be absent 
Hence we may write 

= r(l -f- a 0 P 0 -f- a^P 2 (0) + ...), . . . (36) 

where P 0 = 1 and P 2 = £(3 cos 2 9 — 1). Multiplying by 2 it sin 9 d9 
and integrating from 0 to w gives us the value of the surface of the 
deformed drop. The result, carried to the second order, is 

= 4«r«(l + 2o 0 + fa 2 2 ) (37) 

Then the surface energy amounts to 

Eo = cA 2/3 (l + 2a 0 + f®2 2 ) (38) 

In the same way, the volume of the drop comes out at 

F = ^(l+3a 0 + |u 2 2 ) (39) 

Inasmuch as the nuclear “ fluid ” must be Considered incompressible, 

a 0 = —\a£ (40) 

Then, according to equation (38) the surface energy becomes 

E n = cA™( l + fa 2 2 ). ..... (41) 

In the expression for the electrostatic energy, the mean of the reciprocal 
of the distance between charges, 1/r is to be replaced by 1/r'. After 
some computation, this is found to be 

if? = 1 Jr (1 - \A£) (42) 

We now have for the total change in energy 

O r Q V2-2 

AE = AE 0 + AE 3t = g V - ± r J^ a?. . (43) 

If the second term exceeds the first in magnitude, the slightest dis- 
turbance will succeed in splitting the nucleus. The critical condition, 
determined by equality of the two terms, is given by 



and, using the numerical values from (34) and (35), the resulting value 
of the above fraction is about 50. Since Z is equal to about 2.4/5 for 
the heaviest nuclei, the values of the critical atomic mass and critical 
atomic number turn out to be 

A^ fv 312 and Z„ 


125. 
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It appears, then, that actual nuclei having a Z < Z rau will require 
excitation in order that oscillation and consequent necking-in take 
place. The amplitude of this oscillation will determine whether the 
energy state so attained is stable or not. It seems reasonable to express 
this energy of excitation as the product of the surface energy by a 
function of the ratio (Z 2 /A)/(Z 2 /A) crll . The function may be specified 
for the simple special cases of (a) weak charge and ( b ) proximity to the 
critical charge; and by means of skilful interpolation Alvarez and 
Bohr were able to trace the entire course of the function to some degree. 
The results show that the nucleus 92 U 238 requires an excitation energy 
of 6-8 Mev, while 32 U 239 requires 7-1 Mev. According to the formula 
for binding energy, the formation of U 236 by the addition of a neutron 
to U 235 liberates. 6-81 Mev; on the other hand, formation of U 239 by 
the combination of a neutron with U 238 releases only 5*37 Mev. The 
considerable difference results from the fact that U 236 is of ee type, 
while U 238 is ue. Thus the critical factor here is the y-term in the nuclear- 
energy formula, which determines the splitting of the energy surface. 
It turns out that U 238 can be split even by thermal neutrons, whose 
energy is practically zero, while splitting of U 238 requires fast neutrons 
with an energy of about 1-5 Mev. Since the fission products have too 
many neutrons compared with their nuclear charge, free neutrons 
appear in the process and these are able to propagate a chain reaction. 

11. Liquid Drop Treated as a Fermi Gas, 

At the end of § 9 it was pointed out that w 0 is the difference between 
the potential energy and the kinetic energy which is also present in 
the nuclear fluid. A« with the electron shells, we get some idea of the 
distribution of the kinetic energy by treating the nucleus simply as 
a Fermi gas. It is true that the masses are considerably greater, so 
that according to p. 632 it might be doubted that the conditions for a 
Fermi gas obtain here. On the other hand, the density is so much 
greater that this circumstance is more than compensated. Each nucleon, 
and indeed every true elementary particle, must be assumed to have 
spin 1/2, which can appear in either of two orientations. If we differen- 
tiate between protons and neutrons, each cell may be occupied by two 
particles. For a first approximation the difference between proton and 
neutron may be overlooked and each cell of phase space may be occupied 
by four particles.* 

We make a very simple assumption about the shape of the potential 
energy curve, corresponding to the short range of the nuclear forces: 
Let a nucleon have zero potential energy when beyond the nuclear 

* In a more refined calculation, the existence of eleotrostatio forces requires that 
protons and neutrons be assigned to separate cells, eaoh cell being occupied by only 
two particles. 

26 


(B 711) 
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radius r 0 A 1/3 and the constant value — F 0 when inside (fig. 6). This 
situation is called a square well potential. The assumption of this shape 
is better justified than appears to be the case at first glance. 

The spatial part of the phase space is 

* = j r£A (46) 

and the number of nucleons in this region having momentum between 
p and p dp is 

dn—Am. AttjP . (46) 



n 


8x/0 mass units 

L 

-V 0 -34W* 
mars units 

j— •—> 




Fig. 5 



The total number of nucleons is given by 

A —fdn = 16*p - 64rrAr 0 »^; (47) 

from which it follows that 



Since A cancels out, the maximum momentum is independent of the 
number of particles, as long as the number is not so small that statistical 
methods do not apply. The maximum kinetic energy, = p^J2M, is, 
according to (47a), 26-1 when expressed in thousandths of a mass unit. 
Taken in connexion with the experimental fact that the energy of 
removal of a particle is invariably about 8 fin the same units), it follows 
that the depth of the potential well is 34 units, independent of the 
number of particles. Bat the width of the well, r a A v \ changes with 
increasing atomic mass. 

The total kinetic energy is obtained by integrating over all values 
of the momentum: 
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The numerical value is but slightly different from the value of u 0 , but 
with positive sign. Here, then, as in the case of the hydrogen atom, 
the kinetic energy turns out to be about half the potential, but in this 
instance the relation is more a matter of coincidence. 

In the more refined computation which takes into account the differ- 
ing values of potential for charged and uncharged particles, the number 
of neutrons comes out to be 


N 


and the number of protons 


3 h* ™ n '““ 


' 3 h iJj 


The sum must be A. From (47), (49) and (50), 




(49) 


(50) 


(61) 


and the total kinetic 

= irn 

— 3 
20M 

Taking 




2 N . , N-Z , 2Z . N-Z 

A~ 1 + N + Z md A ~ 1 N + Z ■ ' 

and developing in powers of the fraction, we obtain the kinetic 
per particle: 


(52) 


(53) 




+ Z/' 


Thus there appear terms i f the same form as those in the formula 
obtained from the nuclear-energy surface, but with different numerical 
coefficients. The first terra shows that the kinetic energy is exactly 
equal to the negative of the total energy, as already remarked. The 
term in (N — Z) Z /(N + Z) 2 also contains a considerable amount of 
kinetic energy. 
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12. Independent Particle and Shell Models of the Nucleus; Nuclear 
Moments. 

The liquid drop model treated as a Fermi gas accounts in a concise 
way for many properties of nuclei, just as the treatment of the electron 
shells as a Fermi gas accounts for many features of atomic structure. 
However, details such as the mechanical and magnetic moments of 
nuclei fall outside the province of the drop model. In many cases, the 
finer details can be deduced from a model in which a single particle 
is considered and its motion in the schematized potential field of the 
others is studied. The question of nuclear moments is one that comes 
particularly within the, scope of such considerations. * 

The experimental fact that the se-nucleh have zero moment (§2) 
indicates that for pairs of like nucleons (proton-proton or neutron- 
neutron) the angular momenta cancel by setting themselves in the 
antiparallel position. At the same time, the integral values found for 
the angular momenta of the fouf stable mt-nuclei at the beginning of 
the periodic table suggest that the parallel orientation occurs for a 
proton-neutron pair. Apart from these four instances, it seems natural 
to ascribe the mechanical as well as the magnetic moment to the un- 
paired nucleon which is added last. The values of the mechanical 
moment of the ue- or eu- nuclei are relatively easy to determine from 
the hyperfine structure, but it is more difficult to measure the associated 
magnetic moment. Inasmuch as the nuclear fluid must possess kinetic 
energy, part of the moment must be the result of its rotation and the 
rest must come from the spin of the elementary particles themselves. 
The magnetic moment of a nucleus belonging to a diamagnetic atom 
or molecule can be determined by a refinement of the atomic beam 
method (p. 665) developed by Babi. However, this method cannot be 
applied to neutrons. Bloch and Alvarez were able to determine the 
value for the neutron in the following way: A magnetized block of 
iron is used to align (“ polarize ”) the spins in a beam of neutrpns, 
while a second magnetized block serves as analyser. An alternating 
magnetic field, applied in the intervening space, disturbs the alignment. 
This disturbance is greatest when the frequency of the field coincides 
with the Larmor frequency of the precessing moments.* The most 
appropriate unit for expressing nuclear magnetic moments is that 
corresponding to the revolution of a proton with angular momentum 
hj^TT. Owing to the fact that the proton is 1836 times as massive as the 
electron, equation (38), p. 464, yields for this quantity the value 
Us — /W1836 = (5*048 + 0*006) X 10 -24 erg/oersted. *— 

Ha is called one nuclear magneton. The experiments referred to above 

* In quantum language, this would occur when the value of hv is equal to the 
difference in energy of the two orientations in the field. 
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yield the following values for the magnetic moments of the proton and 
the neutron: 

Up = 2-7927fi y : / u„ = — 1-9130/x^. 

In each instance the gyromagnetic ratio (p. 671) is double the number 
above, since an elementary particle has spin 1/2. With regard to the 
signs, the + sign for the proton indicates tha>t the spin and the magnetic 
moment must have the same direction (-f charge rotating), while the 
opposite must be true for the neutron. The sign is determined experi- 
mentally from the hyperfine structure. 
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If there is but one bound nucleon, the resulting magnetic moment 
may be calculated according to the method of p. 670 by considering 
the resultant of the orbital moment l and the spin moment s, both in 
units of A/2 tt, precessing about their resultant, j. Starting with an 
arbitrary angle between l and s, the cosine is expressed in terms of j, 
l and s, with each quantity x 2 replaced by the quantum-mechanical 
square, x(x -[- 1). Finally, the angle is taken to be' either 0 or 2tt, the 
only possible values. Calling the gyromagnetic ratio for the spin g s , 
and leaving the value g t for the orbital ratio open, the result is 
For j = l + i, n = g t l f- 
For j — l ft = g t l - \g s + 

The orbital ^-factor is set equal to unity for protons ; but neutrons, 
because of the absence of charge, can have no orbital moment and so 
<7i must be taken to be zero. 
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Using the above experimental values of g a (5-68 for protons, —3*82 
for neutrons), and plotting the values of /i. as functions of j, we get 
the so-called Schmidt-Schiiler curves on which the observed moments 
should fall. Actually, only a few points fall near the curves themselves, 
most of them scattering over the space between the “ Schmidt limits ”. 
Taking account of errors of measurement, no point falls definitely outside 
this region. The reason for the scatter is to be sought in the fact that 
the unpaired nucleon is not completely independent of the others, as 
.was assumed. 

If we wish to compute the energy of excitation or of complete re- 
moval of a nucleon following the method used for the electron shells, 
we must make some sort of assumption regarding the trend of the short- 
range nuclear forces. A possibility that takes into account the short 
range of nuclear forces is the square- well potential function, mentioned 
on p. 770, for which V is zero beyond a certain distance a and equal 
to a constant value — F 0 within this distance. Using this assumption, 
the simple case of the deuteron will be treated. The Schrodinger equa- 
tion is 

W + ^LE-VirW- 0 (55). 


For the deuteron, M is equal to the “ reduced mass ” (p. 651), \M„. 
For heavy nuclei, M can be taken to be that of the last bound nucleon, 
or M B . Since the magnetic moment of the deuteron turns out to be 
very nearly equal to lie sum of the magnetic spin moments of proton 
and neutron, there is no orbital moment. This means that the quantum 
number corresponding to orbital motion is zero and the ifi function is 
spherically symmetric. If we set 


0 = 


u(r) 


(56) 


we get a simple equation for u: 




(67) 


In order to avoid difficulties with the algebraic sign, we set the negative 
binding energy E equal to —W and obtain 


dH 
dr i 
&v, 
dr 2 


8t t*M 

ii 2 


Wu ~ 0 


( 58 ) 


for r > a. 
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The integrals are 

forr<a: « = .4sin& 1 r 

for r > a : u = Be~ k ' r 

These two solutions must be so fitted that for r — a both ip and dtp/dr 
join continuously (cf. p. 723). The requirement of continuity of ip, 
and hence of u, furnishes a relation between the constants of integration 
A and B. This still leaves one of them at our disposal; but the normali- 
zation condition 

J0*(r) dr — 1, (60) 

which expresses the fact that the probability of finding the particle 
somewhere in the whole of space is unity, fixes the value. 

Continuity of the derivative dtp/dr and thus of dujdr leads to an 
additional condition that does not involve A ox B. This condition may 
be evaluated by the following artifice: 

If two functions / x (r) and / 2 (r) have the same value at r = a, (i.e. 
if f x (a) and also if their first derivatives with respect to r have 

identical values at this point (//(a) = f 2 '(a)), then 

i “ l log/iW L - m- (61) 

Applied to the two solutions (59), this yields 

cot kja = — (62) 

The depth and breadth of the potential-well are unknown, but the bind- 
ing energy of the deiiteron is known to be W = 2-19 Mev. Equation (62) 
does not yield F 0 and a separately, but only a relation between the two. 
Inasmuch as IF is small in comparison with V 0 (of. p. 770), we have, 
from (59), kjki = ^/WjV^ According-to (62), this means cot k^a == 0, 
or kja = n/2. From (59), kya == 2w-v/2MF^A, so that 

r ° = sm (63) 

Determination of the value of A from the normalizing condition and of 
B from the boundary condition on f(r) is of no further interest at this 
point. If we put the value of r 0 in place of a in equation (63), we obtain 
a value of about 50 (in thousandths of a mass unit) for F 0 , which is of 
the right order of magnitude.* 


(k, = j Vmv 0 -wy,} 
(h = -r Vmw. \ 


(59) 
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For higher quantum states the expression for i f> also involves surface 
harmonics multiplied by a function of r. As for atoms, an orbital 
quantum number l is introduced in order to describe the energy states. 
A lengthy calculation yields the following term sequence for the case of 
the square-well potential functions: 

Is 2p 3d 2s 4/ %> 5g 4 d 6ft &* 

2 6 10 2 14 6 18 10 22 2 

The numbers in the lower line are the maximum numbers of nucleons 
that can reside in each state, taking account of the Pauli Principle as 
applied to the two possible orientations for the spin. These numbers 
hold for protons as well as for neutrons, since these are really different 
particles which are distinguishable by their charge. 

The assumption of a sharp-edged 
square well is obviously a bold ideal- 
ization of the actual .ituation. A more 
realistic assumption which is at the same 
time amenable to calculation is the oscil- 
latory potential. 

V = F 0 - \Mo> oV 8 . . (64) 

The depth of the well is characterized by 
the constant F 0 , while the width is the 
opening in the parabola, which depends 
on the value of w Q (fig. 8). While this representation is far from the 
actual situation for large values of r, this is of little consequence because 
the particle is seldom found at these large distances. In this case the 
term sequence and particle numbers turn out to be: 

Os lp 2d 2s 3/ 3p 4g 4d is 5ft 5/ Up 

2 6 10 2 14 6 18 10 2 22 14 6 

A somewhat better approach to the truth is obtained by averaging both 
potentials by taking the mean of each pair of closest-lying levels, which 
are given by the same value of l. 

The spin, which has the effect of bringing about a further splitting 
of the energy levels, has not been taken into account up to this point. 
The spin-orbital interaction is greatest here, so that the situation is 
analogous to the ^'-coupling of the electron shells rather than to the LS 
coupling to the other nucleons, which was taken into account in the fore- 
going treatment by means of the nuclear potential. As in the case of 
the electron shells, the splitting must increase with increasing l. If a 
direct proportionality between the splitting and l is assumed, the partial 
levels of one term occasionally fall between those of another and a new 
grouping resuits which corresponds neither to that of the square well 
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nor the oscillating potential function. There are gaps of considerable 
size in the energy levels following these numbers of protons or neutrons: 

2 8 20 (28) (40) 50 82 120 

A number in parentheses means that, the gap following it is present 
but not as marked as for the others. 

What do these gaps signify ? In the theory of electron shell structure, 
the closed shell of the noble gases is characterized by the fact that 
the energy levels nearest the.ground state are especially far above this 
state; e.g. the electron is particularly tightly bound. For the alkali 
metal immediately following, however, the electron is especially loosely 
bound. In exactly the same way, a nucleus whose number of neutrons 
or protons corresponds to one of the above so-caUed “ magic numbers ” 
is particularly stable and symmetric. This affects all nuclear properties. 
For example, we note the large number of isotopes of Sn (Z — 50). 
The quadrupole moment of such nuclei is very small, increasing with 
the addition of each nucleon (cf.. fig. 5). Thus the magic numbers 
correspond to closed shells, just as do the mimbefs of electrons in the 
outer shells of the noble gases. It must be remarked that 60 neutrons 
as well as 50 protons gives a particularly stable nucleus. This makes 
plausible the unusual stability of “ doubly magic ” nuclei, of which 
a He 4 is the first. 

13. Beta Transformations and the Neutrino Hypothesis. 

The established fact that the energy distribution of the electrons 
emitted in /3-transformations is continuous, is in sharp contradiction 
to all other observations which show that the nucleus, like the elec- 
tronic shell, has a number of definite quantum states of energy. If a 
number of nuclei were to disintegrate with the emission of electrons 
only, the new nuclei would have a continuously varying content of 
energy. How can such nuclei, which occur in the radioactive series, 
emit a-rays all having the same sharp range? Here we have the choice 
between only two courses: give up the conservation of energy for /S- 
transformations, or assume that a particle is emitted which escapes 
observation. So long as there is no other evidence for the latter assump- 
tion, it amounts of course to no more than a plausible way of getting 
around the former disagreeable assumption. There is, however, other 
evidence that another particle is emitted. We saw (p. 744) that, accord- 
ing to observation, nuclei of odd atomic weight have moments which are 
multiples of 1/2. In /3-disintegration the atomic weight is not altered 
but the moment changes by a multiple of 1/2, since the electron spin 
is 1/2 and only a whole-number orbital moment can be added to this. 
This difficulty is surmounted by giving the hypothetical particle, like 
any other elementary particle, a spin of 1/2. Its mass must be much 
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smaller than that of the electron, presumably less than 1/5000 the 
electron mass. In nuclear reactions where such particles must be 
assumed to be produced (e.g. the decay of the neutron) the charges 
of the remaining products are found to balance. This, together with the 
fact that the new particle produced no observable track, indicates that 
it has zero charge. This particle, whose existence had to be assumed in 
order to retain the conservation of energy in a number of observed 
nuclear reactions, was named the neutrino. 

Recent cloud chamber observations of /S-decay, undertaken for this 
purpose, show that the conservation of momentum is not fulfilled for 
the emitted electron and recoil nucleus alone. More recent experiments 
for detecting possible neutrino capture by protons now have yielded 
direct evidence for the existence of the neutrino. 

A further circumstance to be considered is the absence of any anni- 
hilation radiation in K-capture (§7). Meitner performed an experi- 
ment which showed that neutrinos are not simply y- radiation. The 
heat developed in a ^-reaction was measured in a calorimeter which 
was arranged to trap any y-radiation which might have been produced. 
Had the heat been produced by y-rays, its amount would have corres- 
ponded to the maximum energy of the electrons. It was found, however, 
to correspond to the average energy of the electrons. 

Neutrinos have such small interaction with matter that they appear 
to pass right through the solid earth without appreciable absorption. 
The question of what happens eventually to their energy is a very dis- 
turbing one, particulary since it is far from trivial in amount. It is 
estimated, for example, that of the total energy production of a uranium 
reactor, some 2000 kilowatts is carried off by neutrinos. 


14. Mesons. 

Besides the new particles already mentioned — neutron, positron 
and neutrino — there exists a group of particles whose masses lie be- 
tween that of the electron and that of the proton. These particles, 
called mesons, occur both with positive and with negative charge, and 
perhaps even in the neutral state as well. The first indication of the 
existence of mesons was obtained in cloud-chamber work with cosmic 
rays. Tracks were found that were definitely thinner than those of 
protons and they were formerly ascribed to electrons. But according 
to the photographs the loss of momentum of these particles in passing 
through a lead plate is only one-tenth of that to be expected from an 
electron. This difficulty can be avoided by assuming that the particles 
have a mass of about 220 times that of the electron. Yukawa had 
already deduced the existence of such a particle from considerations 
which at first sight seem more formal. For the electrostatic field we 
have Laplace’s equation A<f> = 0. Since nuclear forces must fall away 
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very rapidly with distance, he wrote down an analogous differential 
equation 


A<f, - k z <f> = 0, 


(65) 


which, with k = l/r 0 , yields the solution </> = e~ Tlr ’jr and also gives 
the required short range of nuolear forces r 0 , approximately equal to 
10 _13 cm. If, now, by analogy with the wave equation for electrical 
forces we write down the wave equation 

= 0 (66) 

for the field of the nuclear forces, comparison with the theory of de 
Broglie and Schrodinger shows that this is the wave equation for 
particles with rest mass kh/2irc, which corresponds in value with the 
mass which we have to assume for the meson.* An essential feature 
of Yukawa’s theory is that there is an interaction between mesons and 
light particles which gives rise to transformations of one into the other, 
just as there are transformations of neutrons into protons and vice 
versa. 

Experiments with mesons produced artificially by impact of high 
energy (several hundred Mev) protons to a-particles on heavy nuclei 
have shown that the mesons involved in nuclear forces are more massive 
than the ones originally found in cosmic rays. The heavier mesons are 
called Tr-mesons. They occur with charges of +e, — e and 0. The ir + and 
it~ have a mass of 273m 0 , while the n° has a mass of 262 m 0 . The lighter 
ones, called /r-mesons, do not produce nuclear disintegrations. They 
occur as jx + and fx~ and have a mass, of 207m o . It will be remembered 
that X-ray “ Bremsstrahlung ” is caused by the acceleration of elec- 
trons. In an analogous way, 77 - mesons are conceived to be formed when 
nuclear particles are accelerated. Both the tt- and the ju-mesons are 
unstable. The half-life of a 77 -meson amounts to about 2 X 10~ 8 sec. ; 
that of a /x-meson is about 100 times longer.f The current theories 
seek to account for these and other possible mesons as excited mass- 
states of a single kind of particle. 


* If wo apply non-relatmstic wave mechanics, wbioh we have always used thus 
far, to a particle subject to no forces {0 = 0) and having a velocity v (assumed to be 
much less than c), then with E = comparison of equation (65) above with 

equation (28), p. 696, gives the mass as m = Ichl2rrv. The relativistic wave equation 
(not used elsewhere in this book), which was first developed by de Broglie, has the form 
(66) for a particle under no forces, k* being replaced by This gives by com- 

parison m = lcK\2irc. Since mesons are particles of very high velocity, the relativistic 
equation is called for. 
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15. Brief Survey of Our Knowledge concerning Cosmic Rays. 

As far as the amount of converted energy is concerned, chemical 
processes may be characterized as the “ physics of electron volts ’, 
nuclear transformations as “ Mev physics ” and cosmic- radiation 
phenomena as “ million Mev physics It is to be expected that an 
increase of many orders of magnitude in the energy involved brings 
with it entirely new phenomena. The first thing that had to be done 
in order to explain the nature of cosmic radiation was to distinguish the 
effects of the primary rays from those of the secondary. The effects of 
the primaries must increase monotonically with altitude, reaching a 
constant value beyond the atmosphere. The effects of the secondaries, 
on the other hand, must attain a maximum at a certain height; for 
they originate in the air itself and the primary radiation which causes 
them will be progressively absorbed. 

Apart from a small contribution of secondary particles scattered 
from below, the primary rays can be studied in almost pure form at 
great altitudes. Experiments have been carried out in aircraft up to 
about 10 miles, recording apparatus and photographic plates have been 
taken to heights of about 20 miles in unmanned balloons, and high- 
altitude rockets have carried instruments to over a hundred miles. 
The principal effect of the primary particles is the production of single, 
very energetic collisions which show up on a photographic plate as 
“ stars ” in which many nucleon and 77-meson tracks radiate from the 
point of impact. The number of ion pairs simultaneously produced by 
these particles is sufficient to actuate an electrometer of high sensitivity. 

At the present time, the most important means of investigating 
the primaries is the photographic-plate technique. Here the fefacks are 
produced by fragments of silver or bromine atoms which have been 
disintegrated by the intrusion of a primary particle. What is the 
nature of these particles? They must be charged, since their distri- 
bution is found to depend, on the geomagnetic latitude of the place of 
observation. To-day it can be considered well established that the 
majority of the primary particles are protons. There are also heavier 
particles present, ranging up as far as Fe nuclei, a-particles being 
especially numerous. Because of their greater range, the protons pre- 
dominate at greater depths. 

At the earth’s surface and at moderate altitudes only few primary 
particles are observed, most of the rays being a mixture of quite diverse 
secondary particles. Although very energetic secondary nucleons and 
7r _ -mesons are able to produce small stars, these particles constitute 

• In American publications, the abbreviation Bev is sometimes used to denote 
a unit of 10* electron volts. It must be remembered that the term “ billion ” stands 
for 10* in America and for 10 la in Britain. 
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only a small part of the secondary radiation.* It has been known for 
a long time that the radiations detected at lower levels in the atmos- 
phere consist of two components — a penetrating (hard) component and 
a softer one. The separation of the components and the determination 
of their properties have been the object of an extremely large amount 
of experimental and theoretical work. It has been shown that the hard 
component consists of /x-mesons while the soft component is made up 
of less energetic electrons and y-photons occurring in the form of 
extended showers stretching over distances often exceeding hundreds 
of meters. 

The /x-mesons, formed by the disintegration of w-mesons, have a 
lifetime long enough to permit them to penetrate deep into the earth. 
Their ionizing power is so slight compared with their energy that a 
particle having an energy of 10 11 ev is able to penetrate 500 metres of 
water. The electrons and y-photons begin to appear even at the very 
top of the atmosphere. Here the short-lived neutral w-mesons decay into 
two y-photons. By pair production and Compton processes these give 
rise to fast electrons. The electrons, in turn, produce y-photons when 
retarded, and so the process continues. The result is a cascade shower 
in which the available energy is rapidly shared among a large number 
of particles of low penetrating power. 

Special triggering arrangements are needed to catch the beginning 
of a large shower in a cloud chamber experiment. If the point of origin 
happens to be located in the material in a large chamber, the electro- 
meter will show an abrupt impulse as for an explosive shower (see 
above). Usually only tracks of individual electrons or positrons are 
seen in the cloud chamber, and sometimes /x-meson tracks as well. 

The biggest question of all is that of the origin of cosmic radiation. 
At first it was thought that the fays were due to some mysterious 
instance of conversion of matter into energy. This is known to be false, 
if only for the reason that the annihilation of even the most massive 
atom would not be enough to account for the energies of up to perhaps 
10 17 ev that are encountered. In the light of present experience-with 
particles accelerated in the laboratory, it seems plausible that the cosmic 
particles obtain their energy from stray magnetic fields in space. The 
particles may in part originate in the sun. The magnetic fields, because of 
their enormous extent in space, need not be strong, but might be more 
concentrated in the neighbourhood of certain stars, especially those 
showing strong emission of radio waves. It is still very much an open 
question as to which of the possible mechanisms actually obtain. The 
fact that the observed intensity is the same in all directions is in accord 
with such theories, for the particles are probably under way for thou- 
sands of years before reaching the earth. 

• The ir+-mesons are repelled by the nuclear charge and so do not ordinarily lead 
to disintegrations. 
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Further Matters connected with the Geometric Optics 
oe Light and of Electrons 

1. Condition for Absence of Distortion and its Relation to the 
Condition. 

On p. 406 we deduced the condition governing the point-for-point 
imaging of an element of area normal to the axis with the provision 
that axial points be imaged without distortion by means of wide 
pencils of rays. Inasmuch as the squares of the linear dimensions of 
the area were neglected in this derivation, the resulting sine condition 
guarantees only the imaging of a small area, e.g. the field of view of a 
microscope. But if requirements in the quality of the image near the 
boundaries are less stringent, the range of validity of the result may 
be considerably extended. We wish now to consider the other extreme 
— the imaging of a large field of view by means of narrow pencils. The 
limiting case is that of the pinhole camera, which uses no optical unit 
(such as a lens) at all. The sharpness of the image depends solely on 
the size of the aperture. We need not use such narrow bundles of rays 
if we use an optical system, but they must still be fairly small if other 
errors such as astigmatism (p. 409) are to be minimized. It is a chal- 
lenge to the skill of the optical designer to keep these other defects 
small so that good images result even when a large field of view and 
wide pencils are employed. We shall not concern ourselves with these 
other errors. 

Let the cross-section of the entering bundle of rays be limited by 
a diaphragm AB, fig. 1, assumed to be placed in front of a lens system 
S, indicated by the two curved surfaces. In this case AB is also called 
the entrance pupil. If there happen to be lenses on the side toward the 
object, the entrance pupil is found by locating the image of the dia- 
phragm as formed by these lenses. The reason is that a ray passing 
through the edge of the entrance pupil must then pass through the 
edge of the diaphragm, since entrance pupil and aperture correspond 
to each other. In the same way, the boundaries of the pencil of rays 
on the image side are determined by locating the image of the dia- 
phragm as formed by any lenses lying on this side of it. This image of 
the diaphragm opening ic called the exit pupil and is represented by 
A'B' in the figure. 
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versing affairs now, and placing the distorted grid at PQR, its image 
at P'Q'R' will be rectangular. This was used by Abbe as a sensitive 
test of the sine correction of microscope objectives. He placed the 
entrance pupil AB — a small hole — in the object plane of the objective 
to be tested, and noted whether the image formed at P'Q'R', the plane 
for which the figure was computed, was free of distortion. Fig. 2 
shows one quadrant of such a test figure. 



Fig. a 


2. Focal Length of an Electron Lens. 

The parallelism between ray optics and the mechanics of particles 
explained on p. 693 shows that in representing the orbit of an electron 
in an electric field by a ray of light in a homogeneous medium we must 
think of the medium as having an index of refraction proportional to 
the speed of the electron. This follows from a comparison of equations 
(15) and (16). As long as we confine attention to geometric electron- 
optics, the factor of proportionality is immaterial and equation (23) 
is suitable for a wave-mechanical treatment. If the electrons owe their 
velocities only to electric fields, i.e. if their initial thermal motion can 
be neglected, then n may be set equal to the square root of the potential 
difference V that exists between the region considered and the cathode. 
As long as only rays near the axis are considered, a rotationally sym- 
metric electric field (fig. 3) corresponds to a series of centred spherical 
surfaces as already treated on p. 405. The sole difference between this 
case and ordinary optics is that here the index of refraction changes 
continuously, whereas in optics spherical surfaces usually separate 
regions in each of which the index is constant but different from one 
region to the next. This discontinuity does not introduce the slightest 
difficulty as long as the distance between shells can be neglected. This, 
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of course, makes the result a fairly rough approximation. Successive 
application of equation (25), p. 405, to a series of k layers bounded by 
centred spheres gives, for the power , 


On the other hand, from equation (21), p. 405, the power of a 
spherical surface of radius p and discontinuity of index amounting to 
An* is » 

(3) 



The integration may be performed between the limits — oo and -foo 
because dVjdx vanishes in the field-free space and there is no con- 
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tribution except where dV/dx differs from zero. The quantity V ' de- 
notes the constant potential that exists at a great distance beyond the 
lens. 

The radii of curvature of the equipotential surfaces may be found 
in terms of the change of V measured along the axis by using dif- 
ferential geometry and the potential equation AF = 0. This fact 
brings out a fundamental difference between ordinary optics and elec- 
tron optics which complicates the process of computing electron lenses. 
In the case of light, the curvatures and indices of refraction of lenses 
can be selected independently, and so by skilful choice the lens errors 
can be compensated. In electron optics, however, the radii are already 
determined by the change of index along the axis. The result of the 
somewhat cumbersome calculation, which may be transformed by 
partial integration, takes the following forms, since dV/dx vanishes 
at great distances: 



As we can see by considering fig. 8, the repeated neglect of the 
distance between lenses must result in a considerable total error, but 
the optical computation taking these distances into account becomes 
very troublesome. Generally, in electron optics, the calculation of 
electron paths leads more quickly to the result than does the formal 
comparison with ordinary optics. Moreover, the former has the ad- 
vantage that all the required equations are already at hand. For this 
reason we shall compute the focal length only in this way. As ah 
approximation for narrow pencils we equate the projection of the 
orbital velocity on the axis to this velocity itself. Then 



where F is the potential referred to the cathode. 

Let the radial component of the field at a distance r from the axis 
be denoted by E r . Then, since 

dr dr dx dr 

di dx dt ~ dx* 

we have 



• • (9) 
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In a charge-free space, E r can be expressed in terms of E m , the com- 
ponent of the field along the axis. Imagine a cylinder of radius r and 
short length l described about the axis, and calculate the surface 
integral of the field over this cylinder. It may usually be assumed that 
there is no charge within the cylinder since the space charge due to the 
electrons is negligible. Then the surface integral is 

W + or *r — . (10) 


The differential equation of the electron orbit then becomes 


Vv 


4 (^ 4 ) 


rd*V 
4 dx 2 ' 


( 11 ) 


Examination of this equation shows that: 

1. The specific charge ejm does not appear, so that the result holds 

for charged particles of any kind. 

2. The equation is homogeneous in F, so that changing the poten- 

tials of all electrodes by the same amount leaves everything 
the same. This shows why an electrostatic electron micr'- 
scope is not sensitive to variations , in voltage. 

3. The equation is homogeneous in r and x. Thus, if all the dimen- 

sions of an electron microscope are changed by a given factor, 
the magnification remains unchanged. 

In order to integrate the equation of the orbit, write it in the form 


r(x x* r» *rw * 

1 ' da? ^ ® dx dx 




( 12 ) 


This shows that we are dealing with a second-order linear differential 
equation whose cofficients are functions of x and may be determined 
by measurement or by calculation. We can bring the equation into 
simpler form by putting 

r = R(x)f(V) (13) 


determining /(F) so that the term in dr/dx vanishes. The calculation 
readily gives 

m -f-» (i4) 


-I eV.(S)- 


F(x)R. 


(15) 


The integration may be carried out by successive approximations, as 
was done in the case of equation (36), p. 101. 

In order to compute the fooal length, oonsider a ray in tbe object 
space, parallel to the axis and located a distance a from the axis. Its 
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intersection with the axis in the image space determines the principal 
focal point F'. From the definition of focal length (p. 399), 



Since the focus lies in a region of constant potential V", (13) yields 

(£X = (f)/'-‘ *> 

In front of the lens we have the constant potential F 0) and so 

o = 2? 0 F 0 -i (18) 

If we now wish to compute (dR/dx),. from equation (15), we can, as a 
first approximation, take the value R — aV^ which holds to the left 
of the lens. By integration we then get 

where the upper limit may be taken to be infinity because the foous is 
in a field-free region. The power of the system then becomes 



For a single lens the potential has the same value on both sides, and 
a better expression for the power of a single lens is obtained by re- 
placing dV/dx by E: 

</>'=> &f + J*?dX, ( 21 ) 

Similarly, the power of a len3 formed by a short, rotationally symmetric 
magnetic field H is found to be 

< l '=8Sr£>' fa 

V being the potential through which the electrons have fallen. 


( 22 ) 
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PIEZOELECTRICITY AND ITS APPLICATIONS 


1. The Phenomenon of Piezoelectricity. 

When an ionic crystal is deformed an electrical volume-polariza- 
tion results, and this, according to the discussion on p. 281, makes itself 
known by the appearance of charges on the surface. In a rough way, 
the occurrence of this piezoelectric polarization may be pictured by 
t hinkin g of a regular tetrahedron whose centre is occupied by an ion 
carrying four positive charges, while each 
comer is occupied by a singly charged 
negative ion (fig. 1). Inasmuch as the 
centre of gravity of the negative charge 
coincides with the position of the positive 
charge,, the structure as a whole has no 
dipole moment. If, now, the central ion 
is held in position while the top point 
is pushed downward, a dipole moment 
will obviously arise. In simple crystal 
lattices such as NaCl there is no resul- 
tant dipole moment per unit volume, since the symmetry of .the lattice, 
insures a mutual compensation. The fundamental requirement for 
the appearance of piezoelectricity is the presence of what is called a 
polar axis — an axis such that the plane normal to it is not a plane of 
symmetry of the crystal. The relation between the deformation and 
the resulting polarization is linear. We can therefore write for the 
components of the polarization vector: 



Px = Yu e 11 + 712^22 + 7l3«33 + 7l4«23 + 715631 + Yie«a>' 

Py — Yn^i + Yz a e 22 + Yza^ i + Ya e z3 + 726% + 72«%>> 

Px — 731% + 732 e 22 + 733% + 734% + 735% + 736%* 


(1) 


The e ilc are the components of the strain tensor as defined on p. 164; 
the coefficients y ik are called the piezoelectric constants. Depending 
upon the degree of symmetry of the crystal, the actual number of 
constants is reduced. For quartz — the most important of such crystals 
in practice — the scheme of coefficients referred to a co-ordinate system 
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conforming to the axes of symmetry (fig. 2) has the following ap- 
pearance : 

yu “ Yn 0 Yu 0 0^ 

0 0 0 0 — y M — y u L . (2) 

0 0 0 0 0 0 J 

In the place of the components of the strain tensor one may intro- 
duce those of the stress tensor.* In a crystal, however, the relation 
between the two tensors is much more complex than in isotropic 
bodies, where two elastic moduli — for example, E and a — give the com- 


f 



(a) (b) 

Fig. a 


plete connexion. In the general case, the symmetric scheme of co- 
efficients relating the six components of the stress tensor with the 
six coefficients of the strain tensor contains 21 different constants. 
Expressing the e ilc in terms of the P ik by means of these relations, we 
get: 

— P x = 8uP u + 8 n P ^ + Sia-Pas + ^14-^23 + ^15^31 + ^le^ia* 

— P » = ^21 P 11 + S22 P 22 + 33 + 8 2 i P w + + SggPja, ( 3 ) 

—Pm — S 31 Pn + Ss&Pm + S33P38 + 834 P23 + 835 P 31 + S 36 P ia . 

The 8 ik , which may be computed from the y tk and the elastic constants, 
are called the piezoelectric moduli. 

* Sinoe the discovery of piezoelectricity by J. and P. Curie it has been the praotioe 
to work with the pressures rather than with their opposites — the tensions. In what 
follows, the Pa represent the components of the pressure tensor, and this is the reason 
for introducing the negative signs in (3), following. 
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Reciprocally, if an electric field is applied to a piezocrystal, the 
resulting polarization causes elastic stresses and deformation. When 
the components of the strain tensor are expressed in terms of the com- 
ponents of the field strength, the coefficients 8 {k appear. The deduction 
is similar to that of equation (56), p. 176. We calculate the expression 
for the energy of the crystal in the electric field. The results for the 
components of tile strain tensor are 

= &U E m -f- Sjj E v -f- 8 al E g , && — S 14 J?a, + 824 E y + S34 E „, 1 

e aa = ^12-®* + + S 38 E x , % = 8^2?* -j- 8 2a E v -f 835 E„ i (4) 

633 = + 823-#* + 833^*, £12 = 8 l 6 E x -f- 83a -f- 6 M E g . j 

For the negative pressure components a scheme analogous to (4) is 
obtained. The quantities y ik appear as coefficients of the E f ; for 
quartz we have: 

— P u = y u 5a, — P 2 s~y li E !ei % = 8 n E x , ®23 — S 14 i^ x , "j 

P22 — P31— Yufiv> e 22= —^n^m «3i=“Si 4 P», 1(5) 

P33 = 0, Pia~yuP»> ®33 — 0, Cig — — 28 11 fi’ tf . J 

The position of the co-ordinate system underlying these relations is 
that of fig. 2. For quartz, the numerical values in C.G.S. units (alge- 
braic signs for right-handed quartz) are : 

yu = 4-77 X 10 2 * 4 * * * , 8 U = 6-36 X 10~ 8 , 

y 14 = 1-23 X 10 4 , 8 14 = -1-69 X 10-®. 

2. Application of Piezoelectricity to the Stabilization of Oscillating 
Circuits. 

The most important application -of piezoelectricity is based on the 
resonance between the frequency of an oscillating circuit and the 
mechanical vibration of a bar or plate cut from a piezocrystal. Let us 

consider the simplest case of a bar-like piece cut from a crystal as shown 

in fig. 2. Suppose that the electric potential is applied to the long sides 
of the bar, so that the field is in the direction of the x axis. As a result 
there will be changes in length in the y direction and at resonance the 

bar will experience strong longitudinal vibrations. We must first 

compute the natural vibration of a bar of length l. This can be done 

in a way quite analogous to the calculation of the vibration of a string 
carried out on p. 183. Here again we can treat the natural vibration 
as a standing wave resulting from the superposition of two waves 
travelling in opposite directions. The speed of propagation is given 
(p. 183) by the square root cf the ratio of the elastic constant to the 
density. For the crystal, we must use the elastic modulus correspond- 
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ing to tlie direction of the length of the bar. The boundary conditions 
here are just the opposite of those obtaining for the string. The greatest 
amplitude occurs at the free ends of the bar. This result may be de- 
duced formally, but is also intuitively evident, since at the ends the 
motion of the bar is completely unhindered, whereas in the case of the 
vibrating string the opposite is true. Thus the fundamental vibration 
of the bar must have a loop at each end and a node at the centre, so 
that — as for the string — the length of the bar is half the wave-length 
and the frequency of the fundamental mode is 



An important characteristic of any standing wave is that all par- 
ticles belonging to a given loop vibrate in the same phase, so that in 
what follows the system may be replaced by two masses, located at the 
ends of the bar, each of half the bar’s mass. These must be imagined 
to vibrate in opposite phase and be subject to a restoring force of such 
magnitude as to reproduce the natural frequency of the bar. We can 
then apply the much simpler relations for the forced vibrations of a 
particle (p. 96) to the motion of either half of the bar. If there were 
no damping at all, the amplitude at resonance would become infinite. 
In reality, however, there is always at least a small amount of damping 
present, due both to the resistance of tbe air at the surface of the bar 
and to a certain amount of internal friction. The latter is not well 
understood in terms of atomic theory even for single crystals. It is 
quite possible that it might vanish altogether for a perfect crystal 
lattice at absolute zero. 

, For what follows, the important thing is the phase relation of the 
driving force to the vibrations of the bar, or of the vibrations of the 
two particles which replace it in the above sense. The phase relation 
is given by fig. 7, p. 99. A very graphic representation is obtained by 
combining both curves of the figure into a single one by the expedient 
of expressing the amplitude a of "either particle as a function of the 
phase angle <f>. For this purpose, eliminate the quantity — to® from 
equations (S3") and (33'"), p. 98, by using the relation 


sin 2 <ji 


tan *<f) 

1 + tan a f 


(7) 


In the notation of p. 1 


i, the result is the polar equation 
F 0 . , 

0 = 7^ SUL0 

P“> 


(8) 


If the damping constant is small — as it is for quartz — the region 

in which the amplitude differs markedly from zero is exceedingly small. 
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As a result, a> may be taken to be constant in this region, and so equa- 
tion (8) will represent a circle. Since the displacement lags behind the 
driving force, the circle must be drawn below the polar axis if the 
driving force F is drawn along this axis, as shown in fig. 3. This circle 
then represents the index diagram for the periodic quantities F and u, 
where the latter represents the displacement of one of the particles. 



If an electrical potential of the form V — F 0 e‘“‘ is applied, the driving 
force is given by 

F~ — yn&Cp (9) 

where b is the width and c the thickness of the bar (refer to fig. 2j. 
The resonance amplitude becomes 

aWt== jSo> ( 10 ) - 

Since the damping depends on the electrodes as well as on small in- 
homogeneities of the crystal, is not to be regarded as a material con- 
stant. Rather, the width of the resonance curve is used to characterize 
the damping of a given crystal. 

The polarization P follows the deformation u without lag, so that 
P has the same direction as u in the index diagram. The displacement 
current is given by dPjdt = icoP, so that it leads u by tt/2. If the bar 
is allowed to pass along the resonance curve by increasing the driving 
frequency from zero, then u will have the same direction as V at low 
frequencies, but the index of the displacement current will be in the 
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direction of uoV. Thus the crystal behaves like a pure capacitance, 
where the current leads the voltage by tt/2 (p. 318). As the resonance 
point is approached, the magnitude of u Increases greatly while its 
index changes to a position it/2 behind F as the resonance point is 
traversed. At the same time, the direction of the index of the displace- 
ment current comes into coincidence with that of F. The essential 
result, then, is that at resonance the system behaves like a small ohmic 
resistance— small because the index of the displacement current 
becomes very large. The capacitance is effectively short-circuited, i.e. 
it has become infinitely large. When the resonance point is passed, 
u falls even farther behind F, the displacement current falls behind F, 
and the system behaves like an inductance or — what is formally the 
same thing according to equation (23), p. 318 — like a negative capaci- 
tance. 

In these considerations we have neglected the usually small capacity 
of the quartz condenser and the displacement current determined by 
it, since at the critical condition the latter is small compared with the 
piezoelectric displacement current. Imagine a quartz vibrator con- 
nected in parallel with the condenser of an oscillating circuit, the latter 
being variably so that the circuit can be tuned to any impressed fre- 
quency (fig. 4). In the absence of the quartz crystal, the capacitance 0 
in the relation 



would have to be made smaller and smaller as the frequency is increased, 
and the curve connecting C and to 0 2 will be an ordinary hyperbola. 



However, as the resonance point of the crystal is approached, the 
capacitance of the condenser is increased by that of the quartz, so 
that 0 must be lower than it would be in the absence of the crystal. 
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At resonance the capacitance of the crystal would become infinite, 
making it necessary to give C a negative value large enough to make 
the sum correspond to <e 0 . 

In practice the ideal conditions are not fulfilled and the capacitance 
of the crystal, while large, remains finite. The farther course of the 
curve follows from the same considerations (see fig. 5). The stabilizing 
effect is explained in this way: In order that to 0 2 change but little in 
the neighbourhood of the quartz resonance, C must change consider- 
ably. In other words, a small change in the value of C — as a result of, 
say, a change in temperature— will have practically no effect on the 
natural frequency of the system. 
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Space-charge Effects in Gaseous Discharges 

1. Characteristic Carre of a Thermionic Tube. 

Let us consider an extended cathode, which emits electrons, placed 
opposite an extended anode, and let us inquire about the relation 
between current and voltage as qualitatively represented by fig. 1 on 
p. 433. Because of the indefinite extent of the electrodes, the problem 
becomes one-dimensional and the results will depend only on a single 
co-ordinate x. Further, we may neglect the initial thermal motions of 
the electrons, whose speeds have approximately a Maxwell distribution 
(p. 595). With this assumption, the current should be expected to 
begin only when the second electrode is made positive, while in fact 
a minute current is found to flow even when there is a small opposing 
potential. 

Under the above assumptions, the values of the potential between 
the electrodes will be investigated. It will no longer be linear, but will 
be distorted by the presence of the 3pace charge. We have the follow- 
ing equations at our disposal: The speed of the electrons at any place 
where the potential with respect to the cathode is 7 is given by 

$*»«* = eF <1) 

If p represents the charge density, the relation given on p. 418 yields 
for the current density * 

i — pv. (2) 

Space charge and potential are connected by the relation (p. 267) 

AF = 4w/> (3) 

The reason for the positive sign of the right side of the equation will 
be understood from the footnote. 

The current density i is constant throughout the space because we 
assumed the situation to be independent of any but the x co-ordinate 
and the current is constant at all points in the circuit. With the help 

* Contrary to general practioe, we here take the positive direction of the current 
to be that of the electron motion. 
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of (1) and (2) we can eliminate p and v, and obtain a differential equation 
for V: 

d?V 4 iri iiri 

vr 

Multiplying both sides by dVfdx allows us to integrate at once, and 
if the potential of the cathode is arbitrarily taken to be zero, the result 

©■-©:—./(©> " 

Under the assumed condition of zero initial speed of the electrons 
there would be no current if the potential of the cathode were to de- 
crease, for then all the electrons would be driven back. On the other 
hand, if (dV /dx) 0 5* 0, all the emitted electrons would arrive at the 
anode and this would constitute the saturation current. Hence the 
only possibility is to set (dV /dx) 0 — 0 for the region in question. Ac- 
tually, because of the initial speed of the electrons, the lowest point of 
the potential curve lies somewhat in front of the cathode, but the above 
assumption greatly simplifies the computation and does not alter the 
final result markedly. With this condition assumed, we get 

< 6 > 

and by separating the variables we can- integrate a second time, obtain- 
ing 

/QjnX 1 /* 

|173/* = (8irt)W — j * or Vocx i/s . . . (7) 

The constant of integration is zero because the potential was assumed 
to be zero at the cathode. The relation gives the variation of potential 
between the electrodes, as influenced by the space charge. If V a is the 
anode potential and a is the separation of the electrodes, this becomes 
,• _ V 2 V(efm) F3/2 /ftX 

9na z ° ' ’ 

This well-known “ three-halves power law ” cannot hold, of course, 
for arbitrarily large currents. As soon as i reaches the saturation value 
there is no further increase, since electrons are being liberated from 
the cathode at their maximum rate. If the initial electron speed had 
not been neglected, the F® /z curve would not appear to change sharply 
over to the saturation line but would have the rounded junction found 
in practice and pictured on p. 433. 
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2. Oscillations ot a Plasma. 

A plasma is a gas having no average space charge and located in an 
electric field. An example is the positive col umn of a discharge tube, 
where the linear variation of the potential (see fig. 3, p. 437) testifies 
to the absence of space charge. A plasma, then, is a mixture of elec- 
trons, ions, and normal and excited atoms. High-frequency oscillations 
can take place in a plasma, as will be shown for the one-dimensional case. 

Let the field strength be E$ and the number of electrons per cubic 
centimetre n 0 in the non-oscillating plasma, and call these same quan- 
tities E and n, respectively, when oscillations are taking place. Since 
the current is practically all carried by the electrons (cf. p. 439), we 
need only consider the motion of these particles. The following re- 
lations are at hand: 

1. The equation of motion of the electron: 



2. The space-charge equation: 

H = — 4we(n — n 0 ) (10) 

3. The equation of continuity for the electron flow, which corre- 

sponds to the hydrodynamical equation (cf. p. 196): 

-g=4l w 

4. The first field equation, which we write in the form given on 

p. 466, with the sum of the “ displacement ” and convection 
currents designated by 8 (scalar value /S) : 

dE 

(c curl H)a- = 4777S — — iirenv. . . . (12) 

Actually, (11) is implicit in (12). Since the divergence of 
g = o curl H vanishes, we must have 


On the other hand, it foDows by differentiating (10) with respect to t 
that 02g_ ^dn 


b again obtain the equation of continuity. 
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A vibrational differential equation is obtained for the electron 
velocity as follows : Multiply (10) by v and add the resulting equation 
to (12). Then, taking account of the meaning of the differential pro- 
cesses djdt and djdt (cf. p. 195), there results 


3E . dE dE . _ , . 

W + v te = '* == ' bTS + * 7TenoV - 


dt 


Using (9), we get 


<Pv 4-7re 2 n 0 


(13) 


(14) 


Now S is not only constant in space but in time as well. For since 
the applied field E — E 0 is due to charges, and not to changing magnetic 
fields, curl E = 0. But the field equations give 


curl curl E = 


4 ? t d8 
c 2 St’ 


so that for curl E 


0, 9S/3t must also be zero. The particular integral 


A 

n 0 e 


(15) 


represents the mean electron current on which is superposed the 
electron oscillations given by the general integral 


v — A — a), where o> 


*%/(?> 


. (16) 


This simple and clear deduction of the oscillations of a plasma is due 
to Rudolf Seeliger. 



CHAPTER XL VI 


Theory of Elastomers 

1. Behaviour of High-polymer Chain Molecules. 

As a class, the rubber-like solid substances differ greatly from other 
solids in their mechanical and thermodynamic behaviour. In the 
phenomenon of expansion, for instance, one must look to quite different 
molecular processes for an explanation. Apart from the fact that 
rubber-like substances are more resilient by many orders of magnitude 
than are other solids, their isobaric coefficient of expansion is negative.* 
Characteristic of all rubber-like materials is the fact that chemically, 
they are high polymers consisting of long chain molecules made up of 
thousands of identical units. An example is polyacrylio acid, a portion 
of whose structural formula is shown in fig. 1. The molecule is so long 


I 

H i 


l 

u, 

i 

k)OH i COOH H COOH ll 60OH it l 

lOOI 


that probability considerations can be applied even to the parts of a 
single molecule. Because of the rotational freedom allowed by the 
simple C — C bond, two adjoining units can, within limits, turn freely 
with respect to each other. The limiting angle is determined by the 
structure of the units themselves. After a large number n of units, the 
net effect of the many arbitrarily positioned units is such as to make all 
orientations of the nth member equally probable. The corresponding 
length is called the statistical chain element A. Thus the giant molecule 
is schematized as a chain of N such segments whose members may be 
oriented arbitrarily with respect to one another. The internal energy 
of the molecule is consequently independent of the form of the chain, 
which is governed only by the laws of probability. If a perfectly flexible 
chain is thrown on to a table, it will usually end up in an irregular, 
coiled configuration; the chance of its assuming the fully-extended 
position is extremely small. We shall characterize the form solely by 

* This can be demonstrated very simply bv hanging a heavy weight by means of a 
length of rubber tubing. The tube is originally filled with cold water, which is then 
gradually replaced byhot. Contrary to expectation, the tube shortens. 

80S 
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specifying the straight-line distance between the two ends, which we 
designate h (fig. 2). If the beginning of the chain is held fast, a proba- 
bility calculation gives the following expression for the chance of 
finding the end of a chain consisting of N segments somewhere within a 
spherical shell whose radii are h and h -f- dh: 


W(h)dh = Ke-*ST'h*dh ( 1 ) 

The approximate nature of this expression will be realized when it is 
seen that values greater than NA, the fully-extended length, are not 
excluded, although their probability is given as very small. However, 
this expression is adequate for our purposes. Suppose a force F is 
applied to the end of the chain in the direction AB (fig. 2). If this 
force succeeds in increasing h by an amount dh, the 
work done will be F dh. According to the first and 
second laws of thermodynamics, we have for a re- 
versible, isothermal change of state 

TdS + Fdh = dU. . . (2) 

However, since the internal energy U is indepen- 
dent of h, the right member of the equation is 
zero and the force is given by 

f =- t 0S),- • • • < s > 

In order to compute the entropy, we need only apply the Boltzmann 
equation to the probability formula. The probability that the length of 
the line AB, drawn in the direction of F, shall be between h and h -+- dh 
is proportional to 



e' ZA '/i 2 dh. 

The factor i-rrh* does not enter, since the end of the chain is no longer 
considered to move on the surface of a sphere, but is restricted to the 
direction of F. We then get 


and 


S 


3M 2 , 

WI* + CODSt ‘ 


F = 


3 lcTh 
NA ■■ 


(4) 

(5) 


Thus the stretching force F corresponding to a given h increases with 
the temperature, or the equilibrium length h corresponding to a given 
tension F decreases with increasing temperature. We saw (p. 575) that 



XLVI.] THEORY OF ELASTOMERS 805 

for a crystalline solid the elastic forces are determined by electrical 
forces between the molecules. For an elastomer, on the other hand, 
the elasticity is caused only by thermal motion which makes a coiled 
configuration of the molecule more probable than an extended one. 

Now it is obviously not possible to experiment with an individual 
molecule. A collection of such giant molecules is not a piece of solid 
rubber but a viscous liquid, since the individual molecules- are free to 
move relative to each other. This is true for the latex and for the 
sticky raw rubber precipitated from it. Only by the introduction 
of sulphur atoms (vulcanization) are the individual molecules tied 
together, permitting us to apply — at least qualitatively — the results 
obtained' for an individual molecule to finite pieces of technically-pro- 
duced rubber, 

2. A Model of Muscular Action. 

In the contraction of a muscle, nature is able to do something that 
we have succeeded in doing only indirectly — the immediate conversion 
of chemical into mechanical energy. Since the amount of free energy 
thus convertible differs but little from the total energy (cf. p. 553), 
the efficiency of the process is very high. In the laboratory we can 
convert the free energy of a galvanic cell into electrical energy which, 
in principle, could be converted in turn with no loss into mechanical 
energy. Unfortunately, however, we have not been able to find an 
electrochemical process for using our greatest source of energy — the 
oxidation of carbon — for the direct generation of electricity. We are 
forced to use the roundabout means of heat engines, with their notori- 
ously low efficiencies. In the light of such considerations, the action of 
animal muscle, which up to now has been quite mysterious, is all the 
more remarkable. The growing understanding of the nature of giant 
molecules (proteins belong to this category) offers the possibility of 
explaining the situation. It has made possible the description of a 
model for the process whereby chemical energy is converted into 
mechanical energy. 

If the molecular segments mentioned above are now considered to 
contain electric oharges, the interna} energy willno longer beindependent 
of h, and the distribution of lengths will be fundamentally changed. The 
presence of the electrostatic forces acts to favour the extended con- 
figurations in which the charges are as widely separated as possible. 
Such electrically-charged groups occur when the sodium salt of poly- 
acrylic acid is dissolved in water. The sodium ion Na+ dissociates from 
the COONa group to some extent; and instead of the COOH groups 
occurring in the acid, charged COO - groups are present. 

A complete computation yields the following results : For uncharged 
segments, the commonest value of h is about 0-08 of the fully-extended 
length of the molecule. With 20 per cent dissociation, on the other hand, 
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this increases to 0-7 of the maximum length. Suppose the molecule is in 
a solution whose concentration of H + ions (or of OH - ions) can be changed. 
If such a change occurs, the law of mass action will bring about a change 
in the degree of dissociation of the COOH group, and the molecule will 
thus be either shortened or extended — the former if there is an excess 
of H+. In the process, work can be performed by lifting a weight, the 
energy coming from the chemical energy of neutralization. This model 
of the muscular process is found to correspond with experience at many 
points. For example, it correctly predicts that each cubic centimetre of 
the dry substance should be able to lift about 70,000 times its own 
weight. 
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Some Functions Occurring in Connexion with the 
Wave Equation 

1. Generalization of x\ for any Value of x. 

In the following development we set ourselves the task of finding 
a function of x which shall represent the value of a factorial for any 
kind of x, including complex values, although this function was origin- 
ally defined only for x a positive integer. This function is called II (x ) ; 
it is often designated also as r(« + 1). It is characterized by the re- 
lation 

Yl(x) = xll(x- 1) (1) 

For positive, real values of x this condition is satisfied by the integral 

II (aj) — r(* -j- 1) = f a> t x e~ t dt', .... (2) 

Jo 

for, integrating by parts, 

+ x y . . (3) 

and this is equal to ®II(® — 1). Now 11(0) — 1, as may be seen by set- 
ting x equal to one in equation (1). Hence repeated application of (1) 
gives, for integral values of the variable, 

n(a:) = r(* -KI) =1.2. 3. 4... a: = 

The integral does not converge when x is negative or when it is 
complex with a negative real part. For this reason Gauss supplied a 
generally valid definition of the function II (*) in the form 

IK®) = lira — (4) 

W ^.(1 + *) ( 2 + as ) ( 3 + »)... (» + *)* W 

Obviously, from this definition, II (x) will be infinite for any integral, 
negative value of x. In order to show the identity of both definitions 
for positive x, consider the integral 

J,(x ) = J V+*(l — t) n ~‘dt. 

807 


(5) 
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When n — s this becomes 




Integrating (5) by parts, 


Repeated application of this recurrence formula yields 


J 0 =/ |»<1 -*)"* = 3 


(1 + x) (2 + x) . . . (n -f x) ‘ n + x -f- 1* 

whence 

( 1 +,)(»+?. - “*<” + * + ‘' X '"* 1 - «•* 

Setting t — u/n in the integral, 

»!»! = w + a + 1 f n ( x _ 

(1 + *) (2 + *)...(» + *) n J 0 \ n) 

According to the definition of e as the limit of [1 -{- (!/«)] " for n 
approaching infinity, we have in the limit 


=f\» e -u dU ' 


^ (1 + *) (2 + x) . . . (n + *) J 0 

whereby the agreement of the two definitions for positive values of x 
is proved. 

It follows from (4) that 
H(--a;) 11(3-1) 

ST 

The denominator, however, is merely the familiar infinite product 
expression for sin7 rx/n, whence 


( 3 ) 
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In particular, putting x = 1/2, 

1I(-|-) = yV- (9) 

The expression T(a:) is conventionally used to represent the 
logarithmic derivative of the II function. It follows from equation (1) 
that 1 

’F(*)=-+'F(® — 1) (10) 

Values of this function may be found in tables. According to the 
definition (4), 


* « - m - 1 - ih - rh — ■ irV.)- (U) 

In particular, 

! F(0! lim ( log, » - -J - l ••• ~ -0. (12) 

The negative of the limiting value of the parentheses is called the 
Euler-Mascheroni constant; its numerical value is 0-5772 .... If an 
integer m is substituted fox x in (11), the subtracted terms from 1 to 
m will be missing in (12) with the result that 

*<"»— c + ! + S + -S- * ■ ' < 1S > 


For negative integral values of x the function II (x) goes to infinity 
as Ijx when x goes to zero, and so its derivative tends to infinity as 
1 /a: 2 - Hence T(x)/n(®) will be finite, and its value may be deduced as 
follows: We have 


y(») _ 

n(x)'~ 


a n(x + 1) 


[x + 1] |V(®+ 1) - j— 1 

n<x + 1) 


from which we get, by repeated application, 

(® + 1) (® + 2) . . . (x + m) f T (x -f- m) — ----- 

t (x) _______ L g +1 

H(*) IT 


x+2 


■M. 


When x — — rn all terms of the numerator vanish except the one 
multiplied by l/(cc -f- m). Thus, on account of the fact that 11(0) == 1, 


'>■■(- >n) 


(~1)"II(>» - 1). 


(14) 
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2. Bessel Functions of the First Bund. 


(a) The Bessel differential equation 


The wave equation 


A u 


l&u 

<?dt* 


( 15 ) 


is transformed in the usual way, by substitution of a periodic solution 


« = v{x, y, z)e*"‘ 
into the equation A v + Wv = 0, 


(16) 

(17) 


where k = to/c = 2w/A. We now investigate the case of cylindrical 
waves, whioh are independent of the z co-ordinate. Then, according 
to Ex. 22, p. 43, the relation between the two remaining co-ordinates 
p and <f> will be 

d z v . ldv , 1 3 2 v . J2 „ ,, 0 , 

s?+- P r P +-?w + m, ~‘°- ' ■ ' m 


Assume the solution v(p, <f>) to have the form of a product of R(p) 
and <£(</>). In general, only those solutions which are single-valued 
functions of position are physically valid, so O must be periodic with 
a period of 2 t r. Therefore we set 

<I>(^) = e 2»4* (19) 

where generally, in physical problems, p is an integer. In some cases, 
however, non-integral values of p come into consideration, so that in 
what follows we shall assume p to be an arbitrary positive or negative 
number. Writing for convenience y(x) in place of R(kp), we get the 
equation 

* 2 S + *S +( ^~^ )y==o - • * • (2o > 

This is the Bessd differential equation. Apart from certain special values 
of p, its solution is not expressible in terms of elementary functions. 


(6) Solution by power series 
We substitute the power series 

y = a x x^ -f- Og®*"* -f 03®*"* + . . . a r ® n, r + ... . (21) 

into the differential equation, getting 

— p 2 )^®*"* + (m^ — jj 2 )^®”** 4 - (m3 2 — p 2 )o3®*"* + . . . 

4- a 1 af tt ^ z 4- o 2 ® m * +2 4 - 4- • • • = 0. 
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In order that the coefficients of each power vanish we must have 


m i = m 1 + 2, ) 
m 3 = m a + 2.) 


(22) 


Since the term with the smallest exponent appears only once, 

= p 2 , 

»h = ±P (23) 


Let us examine first the solution -\~p. From (22), 

m a = p + 2, wig =p + 2.2, m r — pf-(r — 1)2. 

We then obtain the coefficients 

n a i £i °i 

2 m a 2 — p 2 p* + 4p + 4-p 2 2 2 (p.+ l)’ 

°3 = ~ p* = “ 2 2 (p +*2) . 2 = 2* . 2(p + 1)(? + 2)’ 


= _ fl 3 = = «1 

m \ — P 2 , 2 2 (p + 3) .3 2®.2.3(p + l)(p + 2)(p + 3)- 

From this we generalize to 
a k 

Uk+1 ~~ 2 **(p + A) 

(-D*«l „ (~l)*n (p)O! /2 

~ 2 2 *II(*)(p + l)(p + 2) . . . (p + k) - 2 2 *IT (A;) n (p + ty { v 


The coefficient a, is arbitrary, smce a multiplicative constant may be 
inserted into any homogeneous linear differential equation The 
normalization, of what are called Bessel functions (sometimes cylinder 
functions) of the first kind is attained by setting — l/2*’II(p). The 
result is 

7 _ ** _ 2:2 4 * 

9 2»n(p)L 2 2 (p + l) + 2*n(2)(p + l)(p + 2) 


2®n(3)(p + l)(p + 2)(p + 3) 


+•••] 


“*7 0 n(*)n(p + *j 


or- 


(25) 


Comparison with sin a: and cosa; shows that the convergence of these 
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series is excellent. The character of J 0 and of J 1 is to be seen from 
fig. 1. 

In the above derivation there is nothing which restricts the order 
of the Bessel functions to whole numbers. Even for non-integral values 
of p we are able to find at once a second independent solution which 
supplies the second constant *of integration (the general solution of the 
second-order equation (20) must contain two independent constants). 
Examining now the value m x = —p, we get an entirely different power 
series. For instance, forp = 1/3, the exponents 1/3, 7/3, 13/3 . . . occur, 



while for p = — 1/3 we get — 1/3, 5/3, 11/3 .... Thus for non-integral 
p, the general solution of the Bessel equation is given by 

y^AJ^ + BJ^ . (26) 


The power series behaves quite differently for negative integral 
values — n of p. Since II(— n) is zero according to equation (4), p. 807, 
there is no contribution to the sum until k — n. In order to begin 
the summation with zero again, we set k = l + w and obtain the 



/_„ = s 

i-o 


(-1)*+" / a5\ n+ai 

n(i+«)n(i)W 


• (27) 


But this is identical with (— l) n J„. Hence for integral values n of the 
older number p, = (_I). J, (28) 


In the important instance, then, where p is integral, m = — p does not 
supply a second independent solution. This solution must be de- 
termined separately (see § 3, below). 
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(c) Relations between the Bessel functions and thevr derivatives 

The Bessel functions have certain similarities with the trigonometric 
functions ; in particular, J 0 (x) is comparable with cos®, Jjfx) with sin*. 
Like the Bessel functions, the trigonometric functions have infinitely 
many zeroes. This similarity also exhibits , itself in the relations between 
the derivatives of the functions and the functions themselves, com- 
parable with the relation d (sin x)jdx — cos*. 

Let us show, first, that 

«V(») i(*)-«W*)] (29) 

By differentiation of the series expression for J v we get 
vK) t kton(k)n(p + k)\2) 

Writing the factor p + 2k in two parts, p + k and k, we can obtain 
two summations. The first yields the series 


i v (y + &)(--i) fc /*y + 2t - 1 

VonwiKp + zow 


/ x \p+2k-l 


The series formed with the second factor will be 




( 


Since l/II (—1) = 0, the term corresponding to k — 0 will vanish and 
the denumeration will really begin with k — 1. In order to start 
counting with zero, we put k — l + 1 and obtain 


/ x \p+2l+l 

i-o nWn(p-M + i) \s) = * +1 K * h 

which proves the relation (29). In particular, since J_ x (*) = — t7 x (*), 

Weg6t Jo (*)=**! (4 (29a) 

In the same way, we find that 

J»' (*) = -? J, (*) + Jp-x (*) (30) 


From (29) and (30) it follows that 


i JAx) = J*-iW + Jp + t(*). 


(31) 
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A variety of further relations may be deduced from the recurrence 
formulae (29) to (31), but none of these will be given here. An impor- 
tant integral relation, however, will be derived. Multiplication of (30) 
by x* yields 

x v Jf ( x ) -f px v ~ 1 J v (x) = a (x). 

The left side of the equation is the exact derivative of x T J v (x), so that 
the result of integration is 

J vPJv-i, (x)dx — x v J v (x) (32) 

(d) Representation of Bessel functions of integral order by 
definite integrals 

It will be shown that for integer values n of p, the fundamental 
differential equation (20), p. 810, is satisfied by the function 

— - f cos(xsin^ — n<f>)d>j>. . . . (33) 

rr Jo 

Differentiating, 

~ ~~ \ f — nf>)Binf> df>; 

^'dx?~ ~ — ~ f” 008 ( x — n<f>)am z <f> df>. 

Substituting in the differential equation, we must have 

J cob (a? sin^ — n<ft)sm 2 f>d<ft + sin(* sin^ — n<f>)sm<j> d<f> 

+ / 0 (^ _1 

To prove the validity of this equation, transform the second integral 
by integrating by parts, obtaining 

J sin(x sin <f> — ruf>)Bm<f> d<f> = — £sin(x sin^ — n<f>)co&<f > j 

4 -f * oos ft oos {x Bin <f> — n<f>)(x cos<f> — n)d<f>. 

Combining with other integrals, we get simply 

— ~j”cos(xsm<j> — ntf>)(xcoa^> — n)eUf> = — — jjrin(x sin<£ — n<£)J 

which turns out to be zero, thus proving that the function (33) satis- 
fies the Bessel equation. 


^cos (x sin <f> — 7uf>)d<f> = 0. 
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(e) Orthogonality of the Bessd functions 

The computation of the coefficients of Fourier series is made possible 
by the vanishing of certain integrals of trigonometric functions. The 
Bessel functions have similar properties, so that Beries developments 
can be set up in t§rms of Bessel functions instead of in terms of trigo- 
nometric functions. This may be shown by starting with the following 
differential equations: 

+ (a 2 ®* — p*)yi = 0, 

+ xyf -f (p*x 2 — p 2 )y t = 0. 

By making the substitutions z — ax, z = fix, it is easy to see that the 
solution of the first equation is y x = J„ (ax), and that of the second is 
y s = J* (fix). Multiply the first equation by yjx, the second by yjx, 
and subtract: 

*(yi"y« - Wvi) + (viVz - yM + («• - = o. 

Multiply by dx and integrate from 0 to 1. Then, since the first tvo 
parentheses taken together are exactly equal to the differential of 
®(yi'y 8 — yfy i), we have 

(« a - y*dx = —\x(yfy 2 - yfyj\^ 

Substituting the Bessel functions in place of y and remembering that 
yf — aJJ ( ax ), there results 

( a a — /3 s ) y xJ„ (ax)J „ (fix)dx 

- (ax)Jf(Px) - a J» (px)Jf(ax)}]\ 

If we exclude functions of fractional, negative orders, p < — 1, which 
become infinite of too high an order when x = 0, the expression in 
brackets will vanish at the lower limit and we get 

If, now, a and /8 are two different zeroes of the Bessel function, we have 
the first orthogonality equation, 

J xJ 9 ( ax)J p (f}x)dx — 0. 
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The expression becomes indeterminate when a = fi. Assuming jB 
to differ from a by a small quantity which is allowed to approach zero, 
the value for o = j8 may be obtained by differentiating numerator and 
denominator with respect to /?, and then setting jS — a. The result is 

Using the fact that J„( a) = 0, we get the second orthogonality equation 

f'zJJ (ax)dx = a) (35) 

ro 


3. Bessel Functions of the Second and Third Kinds. Asymptotic Values 
of Bessel Functions. 

(o) Expansion of Bessel functions of the second kind in scries 

When integers are substituted for p in equation (25) no new function 
is obtained. To find this result, we pass to the limit by letting an 
order number which differs slightly from a whole number approach 
the integer value. Consider the e 


N v (*) 


(a?) gos pir — J_ v (x) 
sinpn * 


(36) 


which certainly represents a solution of the Bessel equation. When p 
is an integer, both numerator and denominator vanish, and the limit- 
ing value is obtained by differentiating numerator and denominator. 
The result is 


N n (x) = - 


—w sinprrJ p (x) + CQSp-rr 


dJ„(x) 


dj_ 9 (x) 
dp 


77 COSJ377 




u dj p {x) _ dJ - v {xY\ 

. ' h dp Jp-n' 


( 37 ) 


Using the notation of § 1, we get, further, 

(-D 1 


,.. 0 n(k)Il(n + k) 



nwno 
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Multiplying bv (—1)“ and subtracting, we have, by (14), 


v2V,.i» = 2 J n (x) log, | 


_ v (~l)*y(n + *) /a;\"+“ 
s-o n(&)n(n + *) \2/ 


I (-iy+«Y(-n + ft) 

*_ 0 n(fc)n(-w + yt) \2/ 


(38) 


As long as & ft, the same powers of a: occur in the first summation 
in connexion with n 4- fc as do in the second one in connexion with k. 
The coefficients are '¥(k)jYl{k -j- n)7l(k), and if these are combined 
with the first summation, all that remains of the second one is the 
part from k = 0 to k~ n — 1. Here the coefficients are [cf. equation 
(14) of § 1]: 

T(— n + A) n „_&n(» — k — 1 ) 
n(i)n (-»'+*) 1 ' n(*) 

.ruwiy, 

1 IT (n — * — 1) 
rr o TT(*j 

It follows from this development that the Bessel functions of the 
second kind (also called Neumann functions) become infinite as x~ n 
when x approaches zero. For N 0 the sum involving negative powers 
drops out and the first term makes the entire function approach an 
infinite value logarithmically (see fig. 2). 


F (n + *) 


n + *) 
0 " 


r 


(39) 


(b) Bessel functions of the third, kind (Hankel functions) 

Bessel functions of the third kind are defined by 
//^Tz)-J p (s) + ^(z),| 
E/V{x)^J v (x)-iN v {x).\ 

As will be shown immediately, these functions are related to J„ and 
N, in the same way that e ix is related to since and cosce. 


(e) Asymptotic forms of the Bessel functions 

The differential equation (20) may be reduced to a simpler form 
by setting 


y = 


(41) 
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and determining the exponent m in such way as to make the term 
containing the first derivative of z vanish. As may he readily verified, 
this occurs when m — — 1/2, leaving the following equation for z: 

g + ,( 1 _^ i )-°. . ... (42) 

Obviously when p — +1/2 this becomes the differential equation of 
simple harmonic vibration and J 1/2 as well as J_ 1/2 must be of the form 
(A sins Bgo&x)!'*/x. Comparison of the series expressions yields 
the correct values of the constants: 




■ • (43) 


However, for any value of p we again get the harmonic equation if x is 
made large enough. The asymptotic values of the Bessel functions 
become, after determining the constants, 
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'•"«**/(=) 

That is, 

'•""M-i—C't 1 )] 


for x large. 


for x large. 


This leads to a way of representing cylindrical waves. If, in accordance 
with equation (16), p. 810, we set 

u — J (kp)e iwt , 


all particles will vibrate in the same phase and we have a standing 
wave. A progressive wave will be characterized by the use of <o{t — xjv) 
as the argument of the periodic function. As the asymptotic form 
shows, this is accomplished by the Kankel functions /f„ (2) in conjunction 
with e‘“h 

Ex. 135. Represent Jg /2 by means of elementary functions. 


4. Spherical Harmonics. 


(a) Differential equation of spherical harmonics 

The Laplacian potential equation Au — 0 leads at once to another 
class of functions. We seek such solutions as are homogeneous and of 
degree n in the co-ordinates x, y, z. These solutions, expressed in 
three-dimensional polar co-ordinates, are of the form 

u = r n F n (0, <f>) (44) 


We write the potential equation in polar form and substitute (44). 
Then, according to p. 268, the equation for F n becomes 


1 

sin# 




1 3«i , 


(45) 


Inasmuch as the equation remains unchanged if — (to -f- 1) replaces n, 
r < n +i)Jt n is also a solution if r n F n is a solution. 

Assuming independence of <f>, a simpler equation results by setting 
cos# — x and F„ — y: 


( 46 ) 
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(6) Legendre polynomials 

As in the case of Bessel functions, we can find a power-series solu- 
tion for the differential equation. It is found that for one of the two 
solutions the power series terminates, i.e. the solution is given by a 
polynomial. However, a more vivid way of arriving at our goal is the 
following: Consider a unit electrostatic charge located on the z axis 
at unit distance from the origin. At a point P whose distance from the 
charge is r', the electrostatic potential will be 

u = - = (1 + r 2 - 2r cos 6)~ il2 = [I + r(r - 2a:)]- 1 ' 2 . (47) 


If r is small, the radical may be expanded by the binomial theorem, 
yielding a series in powers of r. Since each term is a solution of the 
Laplace equation, the coefficients will be the Legendre polynomials (or 
surface zonal harmonics) P n that we seek. Denoting the largest integer 
winch is less than or equal to m/2 by (m/2], the general expression for 
these functions may be written 


[«/2j 

P n (x) = S (-1)* 


1.3.5... (2» — 2k — 1) n _ 2s 
2*£!(m — 2/fc)! "* 


(48) 


The polynomials for n = 0 to m = 5, inclusive, were given on p. 699 
Since x — cob 8, we obtain F„ in powers of cos#. Using the well-known 
formulas, these functions may be expressed in terms of multiples of the 
angle. For example, for P a , 

P t (x) = £ (3a: 2 - 1) = i(3 cos 28 + 1). 


(c) Relations between spherical harmonics of various orders 

The development of (47), arranged in ascending powers of r, has 
the form 

(1 _ 2 rx + r 2 )- 1 ' 2 = P 0 (x) + rPfx) + r*P 2 (x) f . . . r"P n {x) 

+ r n+1 (P n+ i(») + 

Differentiating throughout with respect to r, 

(1 — 2 rx + r 2 ) -3 ' 2 (x - r) = P x {x) + 2 rP a (sc) + . . . nr^PJx) 

+ (* + l)r"P„ +1 (a:) + . . . 

or 

(1-2 rx + r 2 )~ 1/2 (x ~r) = (P x {x) + 2rP 2 (x) + , . .)(! - 2 re -f r 2 ). 
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Developing the left side once more according to powers of r, 

(x - r)(P 0 (x) + rP x (x) + . . . r n P „ (x) -f r»+ 1 P B+1 (z) + . . .) 

- (Pi&) + 2fP a (a>) + . . .) (1 - 2rx + r 2 ). 
Comparison of the same powers of r yields the relation 

(2» + l)aP.(«) = (» + l)P» + i(*) + nP^ix). . (49) 
In the same way we obtain, by differentiating with respect to x, 

(2n + l)P„(x) = P 1 n+1 (x)-P 1 „_ 1 (x). ... (60) 


(d) The assodaied-iharmonics 

We now take up the consideration of a single-valued dependence of 
F„ on the azimuth <f>, setting 

F n (6, 4>) = e^f n (6) (61) 

We then obtain as the differential equation for /„ 

Taking /„ (cos 6) = y(x), this gives 

In order to simplify this relation, we set y = (1 — x?) v u m and deter- 
mine p in such a way as to make the coefficient of u m a constant. 
The result is p = $m, y = (1 - sty* U m> . . . (64) 

and the differential equation for u m becomes 
(1 - a?) ^5 - 2(m + 1)* [«(» + 1) - m{m + !)]«„ = 0. 

(66) 

Differentiating, 

(1 - * 2 ) ^ - 2(*n + 2)® ^ + W» + 1) - (« + 1)(» + 2)] = 0. 

( 66 ) 


Thus if u m satisfies (55) for m, then du m /dx satisfies (55) for m -f- 1. 

For m = 0, u 0 becomes identical With P B> and so d m P n (x)jdX m 
satisfies (55) for m = m, and the solution of (53) for m = m is 


Pn m (x) = (1 -X 2 )” 


, 2 d m P n (x) 
dx™. 


(57) 
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These functions are called the associated Legendre polynomials or 
associated harmonics. Since the nth derivative of a polynomial of nth 
degree is a constant, we have 

P„”( x) = C( 1 — z 2 )”*' 2 = C sin "0 
and P n m (x) = 0 for m > n. 

If (49) is differentiated m times, the result is 
w d”P n (x) d^P n (x) 


d m P n+1 (x) d m P n _ 1 (x) 


Now if (50) is differentiated (to — 1) times, we get 

i% . , n d m ~ 1 P n (x) _ d m P n+x {x) _ d m P n -i(x) 


Multiplying the last equation by to and subtracting the result from the 
preceding equation, we get, after multiplying by (1 — re 2 )" 1 ' 2 . 

(2n + 1 .)*P,-(*) = (» - to + 1)P” x (x) + (n + m)P^ 1 (x). (58) 

(e) Orthogonality of the Legendre polynomials 

Consider two Legendre polynomials having the same upper index 
but different lower indices n and n' and satisfying the differential equa- 

i [ (1 ~ "> “£-] + [»(»+»- r?s] p ’w - °- 

s [<■ - **> + [*'<"' + « - = »• 

Multiply the first of these by P„™, the second by P„ m , and subtract 
the second from the first. Integrating the result between the limits —1 
and +1, we get 

[n(n -f 1) — n’(n' + 1)]/ P n m (x)P n - m (x)dx = 
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Integrating by fllfiSs, the first integral on the right becomes 

The bracketed expression vanishes at both limits, while the integral 
expression cancels the corresponding one that is obtained in a similar 
way from the second integral in (59). Hence 

J P n m {x)P n . m (x)dx = 0 for n' 4= n. » . (60) 


The computation of J [P„ m (a :)] 2 <& ; is somewhat more trouble- 
some, and so we content ourselves with quoting the result: 

(n + *»)1 


/ x iP» m W] 2 < 


2n + 1 (n — m)\' 


(61) 


As in wave mechanics, the proper functions — mostly Legendre poly- 
nomials — are assumed to be “ normalized to unity ”, the values given 
in tables must be multiplied by the reciprocal of the square root of the 
preceding expression if numerical values are of concern in the problem. 


(/) Application to the wave-mecluinical calculation of the selection 
rules for the magnetic and azimuthal quantum numbers 

In the older quantum theory, the selection rules were deduced by 
applying the correspondence principle to the atom model. In wave 
mechanics, these rules are consequences of the orthogonality properties 
of spherical harmonics. Imagine a unique direction chosen in an 
arbitrarily weak magnetic field, and let this direction be the z axis. 
Neglecting normalization, two proper functions of hydrogen may be 
written in the form (p. 702) 

tjj t = R i (r)P li m <( cob6) cos m ( <f> (i = 1 or 2). 

According to p. 705, the dipole moment governing the radiation ac- 
companying the transition from 1 to 2 will be 

= e Jr ■/qvMu 

where the integration is to be extended over the entire space. Since 
z — r cos 6, the z component of this moment may be written out in 
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the form of a product of three integrals: 

p, = ef rR 1 (r)R 2 (r)r 2 dr 
J o 

X J Pi™ 1 (cos 6) P u m ' (cos 6) coa9sinOd0 
0 

X / cos m,<£ cos»» 2 <f>d<f>. . . . (62) 

J o 

The last integral vanishes, except when = m 2 . This is the selec- 
tion rule applying to the magnetic quantum number for light vibrating 
parallel to the field direction. For — m % — m, the second integral, 
written in terms of the variable ®, becomes 

J 2 - f^xP li m {x)P,™(x)dx. .... (63) 
According to equation (58), this becomes 

(»)*• 

(64) 

The first integral is zero except when Z 2 = -f- 1 ; the second is zero 

except when Z a == Z x — 1. This is nothing more nor less than the selec- 
tion rule l — ► l + 1. 

(g) Legendre functions of the second kind 

For the sake of completeness it should be added that the Legendre 
polynomials, containing as they do only one multiplicative constant, 
represent only one of the pair of solutions of the differential equation. 


A second solution, for m = 0 and n — 0, is 

Q 0 (x) = tanh- 1 ®, (65) 

and for m — 0 and n = 1, 

Qi^^stanh- 1 ® — 1. .'.... (66) 

The succeeding Legendre functions of the second kind are obtained 
from the following recurrence formula, corresponding to equation 
(49), p. 821: 

(* + 1) Qn + 1 (*) = (2 » + 1 )xQ n (x) - nQ n _, (®). . (67) 


In the same way, the associated Legendre functions of the second kind 
are obtained from 


Qn m (z) = (1 - S 2 )’ 


dal™ 


( 68 ) 
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SOLUTIONS OP THE EXERCISES 

The more difficult exercises are marked with an asterisk. In such instances 
it is intended that the solution should be consulted as an aid in solving the problem. 

Ex. 1, p. 10. A + B + 0 = 0. 

Ex. 2, p. 10. (a) B = aA; (b) C = aA + 0B, where a and p are soalars. 

Ex. 3, p. 12. This is the trigonometric Law of Cosines. The alteration in sign 
of the term 2AB cos (A, B) is due to the fact that here oos(A, B) refers to the 
exterior angle of the triangle. 

Ex. 4, p. 12. (A + B) and (A — B) are the diagonals of the parallelogram 
formed by A and B. In the particular instanoe where A’ = B s there results the 
theorem that the diagonals of a rhombus are mutually perpendicular. 

Ex. 6, p. 12. Co»0 = s s s 2 = oostXj cosou + cos^ cos(3 s -f- cos -ft cosyj. 

Ex. 6, p. 15. Project the vectors B, C and (B + C) on the plane normal to 
A, in which the product vectors [AB], &c., lie. The product vectors are obtained 
from the projections by rotation through rc/2 and multiplication by | A | , since 
the magnitude of the projection of B is | B | sin(AB). Since the projection of the 
sum of vectors is equal to the sum of the projections, and since the altered parallelo- 
gram of the projections is similar to the original form, (16) is proved. 

Ex. 7, p. 15. A*1 B* sin* 6 + A‘B* cos 2 9 = A a B*. 

Ex. 8, p. 15: 

(а) Since the projection upon sofa radius vector drawn to any point of the 
plane must be equal to p, the equation of the plane must be 

m = p or *oos a + y oos{} + zcosy — p = 0. 

(б) p a = (r„ - r)n = r a n - p. 

(c) The vectors (r — (r t — rj and (r t — r x ) are to be coplanar, i.e. the 
parallelepiped formed by them must have zero volume, whence the equation 
of the plane is 

or, in terms of the co-ordinates, we get the equation of the plane: 

I * — *i y - V i * ~*i I 

y, - y 1 zj-z, 1 = 0. 

I y*-yi I 

Ex. 9, p. 17. If we set [AB] = E and apply (24) we obtain 

C • ED - D • EC = C • ABD - D ABC, 

i.e. a vector in the plane of 0 and D. Similarly, putting [CD] = F yields 
B • FA — A • FB = B ■ CDA — A • CDR, a vector in the plane AB. Hence the 
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product [[AB][ CD]] is a vector along the line of intersection of the planes of AB 
and CD; this is also evident geometrically. 

Ex. 10, p. 20. When A has a oonstant direction. 

Ex. 11, p. 20. Since <£dS = 0 for any closed surface, the integral over the 
curved surface is equal and opposite to the integral over the plane surface bounded 
by the curve. But this has the direction of the normal to the plane, which is also 
the direction of the binormals of the boundary curve. 

Ex. 12, p. 23. The level surfaces of the function r(x, y, z) are spheres, so that 
the gradient has the radial direction. Proceeding by dr in the direction normal 
to the sphere, the function changes by dr — d j r | — hence the magnitude of the 
gradient is unity, and grad r represents a unit vector in the direction of r. We 
designate it by r Q . 

Ex. 13, p. 23. grad / = f gradr = |r 0 - \%r. 

Ex. 14, p. 25. For a geometric derivation choose as volume element a portion 
of the spherical shell of radii r, r + dr, which subtends a solid angle dd at the 
centre. Since r is radial, it is normal to the spherical surfaces and the surface 
integral becomes 

=» dCl(r + dr) 3 — ddr 3 = 3 dQsPdr. 

Since the contained volume is d-u = dCir^dr, the limiting value becomes div*" = 3. 
According to the formula, this is obtained immediately from 
d? , Sy dz , 
dx " r ~dy ■*" dz 

Ex. 15, p. 25. The value of the exports is equivalent to the gold import; the 
value of goods imported is equal to the gold export. The “ sources ” of the out- 
wardly directed stream of gold are the places where imported goods are consumed, 
and a similar statement is true for the centres of production of exported wares, 
which are “ sinks ” for the corresponding flow of gold. The integral of the gold 
stream over the boundaries represents the balano© of trade; according to Gauss’s 
Theorem, this is equal to the change of the gold supply within the oountrv. By 
the term “ gold ” we mean simply any item of credit. These considerations do 
not include payments without recompense (interest, &c.). 

Ex. 16, p. 29. If radius vectors are drawn from 0 to points on a curve, 
ds = dr, i.e. irds — jdr^JZ = 0, so that curl r ~ 0. This results also by direot 
computation from dzfcy — 0, &o. 

Ex. 17, p. 37. t grad • t = ”, t grad • 6 = — t n. 

P 

Ex. 18, p. 37. If x, y, z are the co-ordinates of a point on an axis, 

4- a la y + Oia* = X* "j 

a n x + o 22 y + a 2S z = Xy l (1) 

a 31 X + + “ 38 * = Xz J 

These equations, homogeneous in x, y, z, are solvable only if the determinant 
vanishes; 

| Ojl — X ®m “is I 

®2i ®*2 “ X a a = 0 (2) 

I Oji a 82 033 — X I 
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This so-called secular equation is of third degree in X. If one root is known, two 
of the equations (1) then give values of y/x and z/x, i.e. the directions of the prin- 
cipal axes. If we imagine the co-ordinate system placed along the principal 
axes, X will give the ratio of .the magnitudes of the veotors and r for the axial 
directions. Since this is true for any co-ordinate system, the coefficients of the 
secular equation must be independent of the co-ordinates. Sinoe, in the system 
of the principal axes, they contain only the a i( , which we denote by a z , on, 
Oxn, we have 

o u + a M + Ugj = Oj + On + o ra , 

a n®2s -h a 2s. a 83 + ®33°ii — (®i3* + “is a + “as*) = ®i°n + o n a ra -f a DI Oi, 

OhOjjOjj -f- o l2 «23a 3 i + a i3*2i°32 — (^n^aa 1 + «*22®is* + <*33 a i» a ) =* a i“u“ra- 
An application of this tensor invariant appears on p. 176. 

* Ex. 19, p. 40. 

(a) For a finite volume, integration of equation (58) (p. 31) gives 
fudS — J grad u dr. But grad ir = i, hence |t> • dS = J ids = iV. 

(b) We determine first the projection upon i of the vector w, which we seek, 
by soalar multiplication by 

iv = §ir- id8 = i§ir-dS = i*V. 

Divide the volume into cylinders parallel to *. The lateral surfaces contribute 
nothing, sinoe id8 = 0, while the bases together yield r • idS, a vector parallel 
to i. The resultant vector is therefore parallel to i, hence v — iV. 

Ex. 20, p. 40. A ^ = div grad i = div — ~ gradr — ~ divr. 

Now gradr = r/r, div r =3, so that A — 0. 

Ex. 21, p. 40. Cf. p. 270. 


Ex. 22, p. 43. Cylindrical co-ordinates: 

!g»d i), - |i. (gnd ^ , (g^dtl, _ ! ?-| 




1 8A+ 
P ®<l» 


8A P 8A, 

8z dp 


(curl A)* 
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Spatial polar co-ordinates: 


div A “ dr (r * Ar) + rsro 56 (8in 9Ae) 

I 8A4 
r sin 0 0<j> 

<eu,I A), - JL- (M? _ | <«m 9A 4) ) 

M-Ef + |9+i3+i-'3 


8J. 

5(j> 


4- 1 ^ 

£4. 22a, p. 46. The truth of (as) is already contained in the solution of Ex. 18. 
Here, in order to show the application for arbitrarily many dimensions, we call 
the co-ordinates x,, x 2 , x 3 , . . . , x„ instead of x, y, z. In this notation, the system 
of equations (1 ) appearing in the solution to Ex. 18, corresponding to the root X<*’ 
of the secular equation, becomes 

s«aft w = ^ h W A) , »' = 1. 2, . . ». . . . (to) 


Likewise, for any other root X (i >, 

■= i = 1, 2, . . . n. . (2o) 


Multiply the first equation by and the second by x/ Ai , perform a sumroatio: 
on each and then take the difference: 

sy/.'v/’ - = (* (ft) ~ . (3i 


But A is assumed to be a symmetric matrix (a ik — a^), so that the double sums 
on the left are equal and the components of the “ characteristic veotors ” * which 
correspond to various roots of the secular equation have the property that their 
scalar product vanishes. They are mutually orthogonal in ^-dimensional space. 

Inasmuch as the equations (1) are homogeneous, they tell us only the ratios 
of the x^ h \ e.g. the directions of the vectors. Their magnitudes are still at our 
disposal, and we may set them equal to unity (“ normalize ” them). The x 1 ,h 


German: Kjen( vectors). 
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give the direction cosines of the co-ordinate system associated with A (in three 
dimensions, the axes of a second-degree surface) relative to an arbitrarily-chosen 
system. If these »* quantities are used to form a matrix in such manner that the 
components corresponding to a given X <A) are placed in one column, then this 
matrix S will be orthogonal and the matrix S -1 will be its transpose. If the com- 
ponents of any veotor r are Xj* in the characteristic system, then the components 
in any arbitrary system are obtained, according to p. 34, by taking 


Multiply r by matrix A, giving a transformed veotor 
r l = Ar = ASr* 

whose components are given in the arbitrary system. According to p. 34, to 
obtain from this the components in the eigensystem we must take 

r 1 * = S-Vi = Sr’ASr*. 

When this is calculated out, S -1 AS is found to be a diagonal matrix having X (l) , 
X (2) , ... as diagonal elements. This must be the oase, since in the eigensystem 
we have simply a, 1 * = X (l >x,*, xj* = X (2) x a *, etc. 


Ex. 23, p. 67. In the real form, /(<) = 1/2, 

to r to r in i a 

b n = - I 1 • cos tu at dt = 0, c„ = - J sin n cat di 

or c n = — — [cos not] 2n,a = — for n odd, 

irnto */» nn 

= 0 for n even. 

Ex. 24, p. 67. By equation (26) (p. 66) the coefficient A s for the interval from 
s to * + da is given by 

A, = J e~o*cos o>x e-* 3 *** dz, 

where the variable of integration is denoted by z, and f(x) is understood to be 0 
for negative values of x. This yields 

A,*=lJ o e-o d* + e-oxe-ioz-iMxtdx 

i , i 

a — t(o> — 2ns) ^ a -j- t(co + 2ns)' 

In practical cases a <C to, so that it is only when 2ns is in the neighbourhood 
°f ±<e that A , assumes values appreciably different from zero. The conjugate 
complex value is obtained for A^. As may be seen by combining the two forms 
and passing to real values, the amplitude of the vibration of frequency s turns 
out to be twioe the modulus of A, (or A_J. The square of the amplitude, which 



APPENDIX 


determines the intensity in the ease of light vibrations, has the following value 
when the second integral is neglected: 

j l 

* a* + 4w 1 (v — s) 2 ’ 

or for v = «, i 

and I vz = i for v - n/2 = ^. 

Ex. 25, p. 61. The extreme cases are a “ figure eight ” closed curve for S = 0 
and a parabolio arc for 5 =» jc/2. 

Ex. 20, p. 64. If the polar angle <j> does not enter, 
i £> « _ i a*u 
dr^^rdr c* dfl ' 


Setting u — v(r)e 1at , 


&v 1 dv <o* 
dr* + r dr c» 


0. 


If we put as 


rto/c, we obtain the differential equation of cylinder functions of 


<Py , 1 dv 
d&^xdz 


0. 


This equation has two distinct integrals, thb Bessel Function J 0 (sc) — J 0 (ra>/e), 
which remains finite at the origin, and the Neumann Function winch becomes 
logarithmically infinite there. The general solution is the sum of each multiplied 
by an arbitrary constant. Tables of these functions may be found in the Jahnke- 
Emde collection (see reference on p. 868). 

Ex. 27, p. 66. Integration of the differential equation (54) (p. 66) under the 
condition 1/V = kv yields the law of dispersion 


Vfcj(v*_C*)' 

Ex. 28, p. 71. Cf. p. 202, formula (32) et seq. 
Ex. 29, p. 74. 

fin dif dt j> 

Jo 1 + e cos <j> Jo , c, .. 


+ * 0 "* J ° l+±(e» + e-») + f (z + \) 


“J 3 * + "- 2+1 

Assuming e < 1, the roots of the denominator a 
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The point Zj lies -within the path of integration. Decomposing the denominator 
into partial fractions, 

* L__ f L L_], 

{Z - Z,){Z - Z 2 ) z 1 -z t \z-z 1 *-*,) 
and the coefficient of l/(z — zj, the residue, becomes 


ft* d<j> _ 2rtt • 2 _ 2rt 

Jo 1 + e cos <j> • 0 / 1 _ VI — i 


Ex. 30, p. 78. If F = (Fy, y'), the Euler-Lagrange equation gives 
d*F* , , 8*F dF n 

dy'iy 9 + 8y'^ 8y °‘ 


d ( ,dF\ dF n 

dx dy') dx~ 0, 
from which the result follows. 

Ex. 31, p. 86. From the equation of motion 


Let (dr/dt,) 0 = t> 0 for t — 0; then 


’ - - -~- a «dn*rf + («• + ~)‘ + r o- 


It is to be noticed that in general there is also a uniform translatory motion 
superposed upon the oscillation. 

Ex. 32, p. 85. Since F = —mgk, where h is the unit vector directed vertically 
upward, we have from 


the result that the motion is independent of the mass. Integrating o; 
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Thus the hodograph is a vertical straight line. Taking the origin at the position 
of the point when t = 0, a second integration gives 

r--|*fc + tv„. 

This shows that the orbit remains in the plane of k and i> 0 . Taking this to be 
the zz-plane and calling the angle between the initial velocity and the z-axis a, 
we have 

x — vj, cos a, 

* = o^sina — ifft 3 . 

Elimination of t shows that the orbit is the parabola 



Besides x = 0, there is a point a where the trajectory crosses the x-axis. This is 
given by 

tana = - — ~ or a = — - sin2a. 

2«„* oos* a g 

This so-called range of the projectile is a maximum for a = jt/4, and has the 
same value for a = w/4 + B as for a = jt/ 4 — p. 

Ex. 33, p. 87. Motion is communicated to the coin by means of friction, 
which depends in this instance only upon the normal pressure between the sur- 
faces. . The frictional force may be taken equal to gang where p is the coefficient 
of friction. Take the z-axis in the direction of motion, the origin being at the 
rim of the glass. Assume the rear edge of the card to be at this point also. Then, 
for the coin, ^ 

* = » + W g ' 

where a is the radius of the glass. If v is the velocity of the card, the rear edge 
of the card has reached the opposite rim after T = 2a/v seconds. If the coin is 
to fell into the glass, it must not move farther than x = 2a in this time, so that 



is the condition to be satisfied. 

Ex. 34, p. 87. No work is done, hence the feeling of fatigue must be due to 
other causes. The system is analogous to an electromagnet carrying a weight. 
No work is done — all the energy consumed appears as heat in the windings. The 
faot that energy is dissipated at all has nothing to do with supporting the weight, 
and if the windings had zero resistance no energy would have to be supplied. 

Ex. 36, p. 88. U = +10*1 2, since grad U = +krr 0 = hr. 

Ex. 36, p. 89. Take the fixed axis as the z-axis. Then 

F = p(r+XA) = mg. 

Multiplying veotorially by r, 
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The projection of the directed areas on the plane normal to the A-axis is ob- 
tained by taking the A -components of the auxiliary vectors. Scalar multiplication 
by A yields r 

° L 

which proves the theorem. 

Ex. 37, p. 89. If, in equation (9), p. 83, the derivatives with respect to i are 
replaced by those with respect to according to the rule 
d = d : = 2c d 
dt ■ <f<S> ^ r 2 ci<j»’ 

a simple computation leads to 

^--W+g- 

For r = p/{ 1 — c cos<j>) we obtain 1/r*. 

Ex. 38, p. 94. In the position of equilibrium x a we have 


For a displacement s, we have, to a first approximation, 


i.e. the total force is 


W 0 * X 0 ll J 


n being the mass of the particle. 


x„ *»»' 



(2) 

(3) 

(4) 


Ex. 39, p. 100. The maximum of the square of the amplitude involves the 
maximum of the function 


m 2 (w 0 2 - to 3 ) 4 4- £*co*’ 
which lies at to* = to 0 * — £S*/2m*. 

On the other hand, the kinetic energy at the origin is concerned with the 
maximum of the function 


m*(to„* - to*)* + (3*to* 

on account of the. derivative x. This maximum is at to = <o 0 . It is to be observed 
that the vibrating system is non-conservative, for the friction dissipates energy; 
for this reason the potential energy at the points of reversal — which is propor- 
tional to the square of the amplitude — is not equal to the kinetic energy at the 
origin. The mean energy is 

® \ «’ (“o’ + “*)• 


(e 711) 
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and this is a maximum if (neglecting small quantities) 

6> 0 * — = <*„*, damped. 

* Ex. 40, p. 103. If a quadratic term is added to the elastic restoring force —kx 
the vibration beoomes unsymmetric, sinoe changing the sign of x does not change 
that of the quadratic term, so that the force has different values at two points 
symmetric with respeot to the origin. The differential equation becomes 

m ^- ffeB + “ , = - p, » COS£ '> i * * W 


As a first approximation, we assume 

x 0 — a cos oii 4- 6 (2) 

The constant 6 allows for the dissymmetry. We insert this expression and de- 
termine a and 6 in such way as to make the constant term and the fundamental 
vibration satisfy the differential equation. Using cos*tc< = $(I -j- cos2«<), we 
obtain the two equations 

6* + *6 + }a' = 0 (3) 

and 

a (oi* - co 0 *) — 2 — 4* =■■ 0 (4) 

For s small, it follows from (3) that 


The other solution has e in the denominator and so is of no consequence, since 
we consider only small correction terms. Putting the above value into equation 
(4), we obtain a cubio equation quite similar to that for the symmetric case: 



This may be solved graphically in the same way, and under certain conditions has 
three real roots, so that the “ skip ** phenomenon occurs here also. For further 
details consult G. Duffing, Erzvmngene ScEudngungen bei veranderlkher Eigen - 
frequenz und Oure technische Bederulung, Sa m mlu n g Vieweg Nr. 41, 42, Braun- 
sohweig (1918). 


♦ Ex. 41, p. 107. Let z # be the height of the point of reversal above the lowest 
point. Then at any height » the conservation of energy gives 


(1) 

Vs g(*„-z). (V) 


We seek to determine * as a function of z. Putting s = /(z), we have ds/dt «= 
f'(z)dzjdl, and we obtain 


-L ~m= 

V2 g Vz 0 — z 


dz. 


( 2 ) 
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Integrating from 


0 


= z 0 corresponds to a quarter period, so that 


1 

Vo 


mdz 

Vz 0 - z 


(3) 


The integral in the right member should be independent of z 0 . The difficulty is 
that this parameter occurs not only in the integrand but also in the upper limit. 
To remove this difficulty we set 

* = y* (4) 

whence 


t = _L r 1 Vz of'(y z <>) *y _ 1 r 1 ^y^J'(^)dy 

4 WgJo vi -y v' 2gJ 0 ' / {l -y)y ‘ 


. (5) 


The result is independent of z 0 if 


Integration gives 


Vyz„ -/'(yzo) = V* -/'(z) = 

/(*) = « = 2a V z. 


• • (6) 
• • (7) 


This curve is a cycloid, as may be shown readily: by equation (6), 


^ = sin^ = i Vz; (8) 

therefore 

z = a* sin* <1> = (1— cos 2<j>); (9) 

■ ■ ■ ( 10 ) 

• -fw + toaM HD 


(9) and (H) are the familiar parametric equations of the cycloid. This method 
of solution is given by Marcolongo (see referenoe, p. 869). 

Ex. 42, p. 107 . Denoting the coefficient of friction by pt, the condition for 
equilibrium on the horizontal plane is m« 1 , /p 1 == \xmg, while on the cylindrical 
surface mg — p.mt),*/p,. Elimination of p, yields 

”= = “ V(p lPl ). 

Substituting the numbers, v t = 46 km./hr. 

Ex. 43, p. 110 . Since the motion is suoh that the common centre of gravity 
is at rest, we locate the origin there. Let m, be on the positive side, m, on tbie 


negative. At any instant we have 

»H*i + m^c 3 = 0 (I) 

and *, — *, = a + x (2) 

The equation of motion for to, is 

+ = , ..... ( 3 ) 
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Eliminating x i by using equation (1), we obtain 



If, now, we put 



p m l m a 

we obtain the ordinary equation of vibration 

dP p 1 OTj 

but with a constant term which gives the rest position of mj, 
integral is 

avW = = _J2l a . 

m z m 1 + >n s 

The general integral is 

x, — x a (°) + A Bin(o) 0 « — 8), <o 0 = 

Thus, in place of the mass m we have the “ reduced mass ” p. 


• • (4) 

■ • (5) 

. . (6) 
a particular 

• • (7) 

• • (8) 


Ex. 44, p. 113. Since the kinetic energy of rotation about an axis is given by 
JSwif d? <|> a , 

where d i is the distance of m { from the axis, the first quantity to be computed 
is the moment of inertia l = £m f d, a for the sphere. In spherical co-ordinates. 


d = r sin 6, so that , 

1 = p il £ = ^4- = nr> • • (i > 

Trot = (2) 

»“-**(*)■ <*> 


For rolling without slipping, we have the relation 

ds = rdif, (4) 

then, by the conservation of energy, 

T = (IM + \M) (g)* == Mgs sina (5) 

/dg\* 10 

and so = — gssma (6) 


If thu. body were to slide without rotation, the factor would be 2 in place of 10/7. 

Ex. 45, p. 113. Designating the velocities before impact by it, after impact 
by v, we have for elastic collision 

rojUj + mgttg — ntjVj 4- wtjUj (Conservation of Linear Momentum) (1) 
m,it, a 4- m z u, a = tnjV t a 4- m, « t * (Conservation of Energy) . (2) 



APPENDIX 


837 


from which it follows that 

_ (m t — wi 2 )a t 4- 2 _ (m 2 — + 2m 1 « 1 


For inelastic impact, the energy equation is 

m-jUi 4- m 2 u 2 2 = + m 2 vf + e 


»+2 

“l + n( 2 «2 + ^/(“i — “a) 2 ~ s ( TO + ~ ) 


1 + 




Ex. 40, p. 121. We have 

is 2 = E 2 (d0» + sin 2 6^), 

ence T = — ^ (i) 2 + sin 2 0 <j> 2 ). 


and the Lagrange equations for conservative forces become 

mE‘Q - mE 2 sin 6 cos 0 <j> 2 + = 0. 

mJP sin* 0 $ + 2 otjS 2 sin0 cos00$ + - 0. 

0<J> 

Fz. 47, p. 145. The line of action for each force and for the resultant B is in 
the vertical direction. The force polygon degenerates into a single vertical line 
of length E. From an arbitrary pole O draw lines to the ends of this line. These 
correspond to the segments F 0 .i and F^o in fig. 5, p. 145. Through any point on 
E draw lines parallel to Fo.i and Fi >0 , cutting the verticals drawn through the 
ends of the beam at points A and B. Through O draw a line parallel to AB cut- 
ting R in the ratio of the load forees. If there are several loads, first find their 
resultant and then prooeed as above. 

Ex. 48, p. 151 . On account of symmetry, the principal axes of inertia are 
xines through the centre of gravity parallel to the edges of the cube. The moment 
of inertia is the same about each of these axes, so that the ellipsoid of inertia is 
a sphere; hence the moment of inertia about any axis through the centre of 
gravity has the same value. This means that the moment of inertia which we 
seek is equal to that about the z-axis, or 

% + a/2 - + a/2 ,. + a/2 Ma 1 

^ = P / a/2 / a/2 + ^) dxd y dz = (f • 


Ex. 49, p. 151 . If I, is the moment of inertia about the parallel axis through 
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the centre of gravity, comparison of the period of the simple pendulum with that 
of the compound pendulum yields 

l = I, + Ms* 

(I = Mag 


This is a quadratic equation for a, for a fixed value of l. The sum of the roots is 
«! + «* = 1. 


Ex. 50, p. 151. The period of a oompound pendulum whose centre of gravity 
is at a distance a from the point of suspension is 


where / is the moment of inertia about the axis through the centre of gravity 
and M is the mass. Putting ljM = a, we have 


T_ 

2tt 




This expression vanishes when a/a * = 1, i.e. when 1 — Ms*. Comparing the 
value 

for this case with the formula for the simple pendulum 

we see that the equivalent length l must equal 2s. 


Ex. 51, p. 151. 

T mm *0*7 + pf «*, (1) 

I = o,*!, + a,*af„ (2) 


where a s and a, are the respective distances of M 1 and if, from the centre of 
gravity. Then 

°i — M~+~M = M^ ~ +M 


so that 


1 = (4) 

g. ~ if, + if,* 


U*. 52, p. 180, According to fig. 8, p. 166, the angle between k and h' is equal 
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TO (<j) — 8). Then, from the magnitudes of the product vectors of equation (54), 
p. 160, we have mga sin 8 = Pen sin(<|> — 8). Solved for 8, this is 


tan 8 


sin <j> 



* Ex. 53, p. 160. The deviation of projectiles is a very complicated problem. 
It is treated in detail by O. v. Eberhard in Auerbach- Hort, Handbitch der jihysik- 
alischen und technischen Mtchanik, Vol. 2. The principal effects are due to gyro- 
scopic action. For projectiles of usual form, the effect of the air resistance is to 
raise the front of the shell, i.e. there is a torque about a horizontal axis normal to 
the axis of the projectile. Since the axis of symmetry is the prindpal axis of 
inertia in this case, we see from the relation 


dP 

di 


W + [“ P] 


that the change in P, and hence the change in the position of the axis in spaoe, 
has the direction of the horizontal axis normal to the axis of the body. For 
right-handed spin, the deviation is toward the right when looking in the direction 
of advance of the projectile. At this stage the projectile, is inclined to the trajec- 
tory and the forces exerted on it by its motion through the air make it veer to 
the right. However, after half a processional cycle the body is inclined toward 
the left. It is often stated that the precession is so slow that the projectile has 
reached its goal before half a precession cycle is completed. This is not true. 
Actually the path of the tip of the projectile is not a oircle ; because of the changing 
inclination of its axis and the resulting shift in the point of application of the 
air resistance force, the point describes a type of cycloid which is always to the 
right of the tangent to the trajectory. The purely hydrodynamic Magnus Effect 
(p. 211) operates in the contrary direction, but is small compared with the gyro- 
scopic effect. 


Ex. 54, p. 167. 


Ar _ Vr"* — y/r* 
r ~ Vr 2 


^-'[■+ES+*<5 + S*+-]- 


Ex. 55, p. 179. The moment of inertia of the cross-section 

1 " Pi fP 3 * drd *~i < r « 4 - r * i) - 


The depression of the middle is then given by equation (65) (p. 178): 

, <?a 3 

12n {r* - Tf)E' 
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Ex. 56, p. 179. Take the origin at the uppermost point of the surface. At this 
point we have 

d*~~ ? (1) 

(9\ 

W~*‘~ p () 

— = — = z = 0 (3) 

dx By ( 

Integrating equations (1) and (2), and using equation (3) yields the equation of 
the surface: 

* = (4) 


The contour lines z = const, are hyperbolas whose asymptotes form an angle a 
with the a:-axis. This angle is given by 


tan a « = -. 


(5) 


Ex. 57, p. 180. Of an elementary hollow cylinder of radii r and r + dr, con- 
sider an element included between the polar angles 6 and 8 + dB. In the process 
of torsion, the upper face of this element is displaced a distance r<j> with respect 
to the lower face, which is at a distance l from it. Then, by p. 173, there will 
be a restoring force of amount d % F = SrdrdBr^/l, whose moment is (PM = 
rd*F = Sr*<j >drdB/l. Integrating over the entire ring, we have dM — 2nSr a dr^/l, 
and integrating over all layers from r = 0 to r = a yields M = (roSa*/2Z)<}> — T(j>. 
t is called the moment of torsion of the wire. 


Ex. 68, p. 190. Since z must vanish at the boundary at all times, the par- 
ticular integral is of the form „ 

z = c mn sin — sin e* ™«,n* (1) 


The differential equation is satisfied if 


(2) 


where p is .the density of the material of which the membrane is made. Equation 
(2) thus gives the possible natural frequencies. As in the case of the string, we 
write the solution, which must satisfy the initial conditions, as the sum (in this 
case, double sum) of particular integrals: 


z = 2> A mn sin — — sin eos27tv nin t 

+ 2> B mn sin — sin ~ sin2Ttv„ m «. 


The coefficients A mn are obtained by the method given in the text: 
A mn = jj^ x ’ y) sin IT “n ^ IT dx dy 


(3) 
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where /„( x, y) represents the initial form of the membrane, and f 0 '{x, y ) gives the 
initial distribution of velocities. 


Ex. 69, p. 194. The force acting on an element of surface d8 whose normal 
is directed outward is dF — —pd8. The minus sign is due to our convention 
regarding the sign of p. The resultant is 

F = — §pd8 = — J gradp dx (1) 

If pg is the vector gravitational force acting on 1 cm. 3 , we have by equation (la) 
(p. 192), 

F = - Jp*<*T= (2) 


Ex. 60, p. 197. Take the z-axis vertically downward. If the oross-section of 
the vessel is large and the orifice small, the flow may be considered stationary. 
Since the only velocity component is one in the direction of the z-axis, we have 
by equation (10a) (p. 196), ^ 



„dv _ n _1 dp 
dx 3 p dz ’ 

. . . . (1) 

whence, by integration. 

£ = pz-? + 0.- 

.... (2) 


According to our conditions, we may take v = 0 in the vessel itself and may 
assume that p is equal to the hydrostatio pressure p gz. Then 0 = 0. At z = A, 
i.e. at the orifice, p vanishes, so that 

v h = V2gh. 


This is Torricelli’s Theorem, according to which the fluid emerges with a velocity 
equal to that which it would attain by free fall through a distance h. 


Ex. 61, p. 197. Taking the z-axis in the direction of flow, we have 

n dv 1 dp 

dx p dx 


• ( 1 ) 


and 

Hence 

Integrating, 


P = Pi 
P Pi 

dv Pi 1 dp 

dx~ pi p dx 


B* 

2 


— — log p 4- c. 
Px 


(2) 

(3) 

(4) 


We may set « = 0 within the vessel; this determines 

C = — log j> x . 

Pi 


Then the velocity of efflux becomes 


V 

2 


(5) 


For another gas under the se 


e conditions of pressure, 

*£ = £log£- . . 


• • (6') 
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Henoe $ = p £ < 6) 

This relation is used to determine the density of a gas in the instrument known 
aa Bunsen’s Effusometer. 

Ex. 62, p. 213. For the oase of water the difference between adiabatic and 
isothermal compressibility may be ignored. We thus have 


P = p(l + xp) = p(l + a) (1) 

or gradp » ^ grad a, (2) 

whence, as in the text, we obtain the wave equation 



To obtain v in cm./seo. we must convert x to (dynes/cm.*) -1 . We have x = 
60/(1-013 X 10 la ), hence 

v = 1423 m./seo. 


The two compressibilities differ considerably for some liquids. For example, 
ethyl bromide has c t ,/c v — 1-87. 

* Ex. 63, p. 227. We start from equation (116) (p. 220). For the oase of 
rotational symmetry, this equation becomes, after putting <[> = gz, 




(the negative sign is to be taken for a concave surface). If z is measured from the 
surface of the liquid in a large vessel, X must be zero, since dz/dr and cPz/di* 
vanish. Remembering that the height of rise will be large compared with the 
dimensions of the meniscus, we may, approximately, take 


z = z 0 + X. 


( 2 ) 


and neglect £ in comparison with z 0 . Then 



Since dZ/dr =» 0 for r = 0, integration yields . 

dZ 



( 3 ) 


( 4 ) 
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The expression on the left is the value of cosfi for r = r 0 ; 
(cos 0 = 1), _ 2a 

■ * ‘ 


in the present case, then. 


(S) 


where z 0 is the mean height of the column. The same result is obtained more 
simply by noting that, since the liquid wets the tube completely, the surface 
film supports the weight of the ascended liquid. The total upward force at the 
film boundary is 2rrr 0 a, while the gravitational force on the liquid in the tube 
is Ttp r 0 a g. Setting these quantities equal, we obtain (6). 


Ex. 64, p. 227. The pressure is the same on both sides of a film, hence we may 
it p = 0 in equation (110) (p. 224). But the resulting equation, 


defines a family of minimal surfaces, i.e. surfaces having the smallest area for 
a given boundary curve. The rotational surfaoe of this fondly, which is sought 
here, is the catenoid — a surface obtained by rotating the catenary y = cosh x 
about the z-axia. 


* Ex. 66, p. 236, According to the results of p. 106, a pendulum swinging with 
small amplitude behaves like an oscillator with a “ quasi-elastic binding con- 
stant ” k = mg /l. By equation (18') (p. 234) the equations of motion are 


m 

and m — -f iy' = — 2mo> sin<j> ~ (2) 


which may be combined into one equation for the complex variable (z' -f iy'): 

5 + iy') + ^W + iy') - -2 im sin 4> ! (*' + iy'). (3) 


Introduce a system of co-ordinates which turns with respect to the x', y" system 
with the angular velocity sin<j> in a sense opposite to the rotation of the earth. 
Then we have 

z' + iy' = ft' + 

g (*' + W) = e~“»sinH ft' + f y) _ iv> 8in +(5 * + 

f lt 2 (*' + »V) = ^ ( 5 ' + ivj') - 2»<o sin^e-'«sin*« * ft' + ^') 


— <o a 8in ! ^»(5' 4- *'■»)') (4) 

Negleoting to 8 , 

~(5' + *V) + ^(5' + »Y) = o (6) 


Thus the ordinary pendulum equation obtains in the rotating system. This 
means that the plane of vibration of the pendulum, seen by an observer at rest 
with respect to the earth, appears to turn from east to west with the angular 
velocity a sin<[i. 



844 


APPENDIX 


Ex. 66, p. 235, If the fundamental equation (7), p. 110, is resolved into 
components along the system of principal axes of inertia fixed in the body, it 
must be remembered that 

dP d . P . r , dto d r to 

ri^' ad w = w- 

However, in this system P x - = Zi<oy, Py — jT n toy, IV = Ima/, so that 
M* = I, - (In ~ Im) oy or, 

My = - (/in - /i)«*' oy. 

M* =Im~^ - (/, - /n)oy toy. 

These are known as the Euler equations. If M = 0 they are satisfied if 
to*' = const., toy = to*' = 0, 
or toy = const?, to*' = toy = 0, 

or toy = const., toy = toy = 0; 

that- is, in the absence of any torque a rigid body can continue to rotate about 
one of the principal axes. Stability considerations show that rotation about 
either the axis of greatest or of least inertia will be stable, while that about the 
middle one will be unstable. 

Ex. 67, p. 258. The energy radiated in one year is 

4w X (150) a X 10 22 X 2 X 4-19 X 10 7 X 60 X 24 X 365 ergs; 

dividing by c 2 = 9 X 10 20 cm. 2 /sec. 2 , the decrease in mass is determined to be 
138 X 10 12 metric tons per year. 

Ex. 68, p. 269. No, for a field of this kind is not irrotational; cf. fig. 19, p. 26. 
Ex. 69, p. 269. 

a = -2-65 X 10~* e.s.u. = -8-83 X 1CT 14 coul./om. 2 
p = 1-32 x 10~* e.s.u. = 4-40 X 10~ 12 coul./om.* ' 

Ex. 70, p. 269. Graphical differentiation of the curve yields — E; a second 
differentiation furnishes the value of — 4rep. The differentiation is most simply 
performed with the aid of a mirror rule, which may be set very accurately per- 
pendicular to the curve simply by observing when the reflection is a smooth 
continuation of the curve itself. The angle which the rule makes with the y-axis 
is equal to the slope of the tangent. 

Ex. 71, p. 276. If the two plates are a distance d apart and if they have a 
difference of potential F, the uniform field between them will be of strength 
Vfd. Apart from the slight non-uniformity at the edges, the external field is 
everywhere zero. This may be looked upon as the mutual cancellation of the 
oppositely directed lines of force from the two plates. Between the plates, on 
the other hand, the number of lines is doubled. According to the footnote on 
p. 266, the force exerted by one charged plate on the other is thus given by 
F = Ve/2d, and since 0 = e/F = S/4raZ, we have F = 2ne*/S or F — F 2 S/8rrd 2 . 
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the 


72, p. 276. Since the situation is independent of the polar angle and of 
3-ordinate, we have, in cylindrical co-ordinates, 


1 dV 
r dr 


0 . 


(1) 


The solution is V ~ A logr + B, (2) 

whence V a = A logo + B, V„ — A log 6 + B, (3) 

from which it follows for V b = 0 that 


y = v 

“iogf 

The field is then given by 

log | r r log h - a 

and the charge e pet unit length by 

^>EdS = 47te = Lti**. 

alog- 

Thus the capacity is 

c = — 

2 log£ 


(4) 


(5) 


( 6 ) 


The solution for the parallel- plate condenser is 

V = Ax + B, E = ^, hence C = .... (7) 

The distortion of the field at the edges is neglected. 


Ex. 73, p. 276. C ril the radius of the sphere a and let the image point — whose 
charge is — e '— be a distance R' from the centre of the sphere. Then 


V 


(1) 


The value of V must be zero on the sphere, i.e. 

e' _ r' 


(2) 


Introducing the angle 6 between a radius of the sphere and the line joining the 
external charge and the centre, we have 


e' a _ r 7 * _ R" 1 + a 1 - 2c R' cosG 
e* ~ r a + « a - 2aR cos© 


r ,R’ + p - 2a cos 6 
^ E + ^ — 2a cos 0 


• (3) 
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This relation must hold for every angle 0; this is true if 


s'* 


( 4 ) 


which determines the position and oharge of the image. 

* Ex. 74, p. 270. Taking the origin of co-ordinates at the point under con- 
sideration, the power series development of 7 at this point is 

V — V o + + <ht) -f <*s* + ®11** + Otty* + <*ss** 

+ 2 a u xy 4- 2a 23 yz + 2a n xc + ... . . . . . (1) 

The produot terms may always be made to vanish by rotating the axes; in this 
system we have 

V = P„ 4- Kx 4- b# 4- b# 4- b n 3* 4- w + + (2) 

The components of field strength are given by 

• • * 

These all vanish at the singular point. In place of the equation of the tangent 
plane at an ordinary point, we have here the equation of the tangent cone 

hP + b t rf 4- b a V = 0, 

where 5, rj, K are the running co-ordinates. The nature of the singularity depends 
upon whether the cone is real or imaginary. Now the potential equation 
A7 = 2(6 U 4 -b a + 6 as ) = 0, i.e. 6 M = -(6 U 4- &„) 

always obtains. This guarantees the reality of the cone and the only possible 
singularity is of the nature of a conical point (corresponding to a real double 
point in plane geometry). For the case of rotational symmetry the angle of this 
cone is determined. If 6 U «= b u , we have — — 26 u , and the half-angle of 
the cone is given by 

tan a =» V2. 


This condition must be satisfied for the tangents to the potential curves in the 
meridional section at the “ abario point ”, e.g. between the earth and the moon 
or between two point charges. 

Ex. 76, p. 284. The case of a uniformly polarized sphere was treated in the 
text. The result was an additional field — 47tP/3 due to the free charges on the 
surface of the sphere. In the present instance the homogeneous polarization is 
present outside the sphere, hence div P has the opposite sign, and so the free 
surface charges are equal and opposite to those arising in the case of the uniformly 
polarized sphere. Thus their field is given by 

E--+1?. 


Ex. 76, p. 284. At any point, D r = 


a<r£b. E r 

E/ 


e/r*; moreover, for 
1 e 

1 e 

Kit*’’ 


c. 
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Ex. 77, p. 284. We givq the image a charge — e'. For the external space we 
may set 



without contradicting the relation ^EdS = 4we, since e' is not in this region, 
and a closed surface in this space does not enclose the charge — e', whioh is really 
non-existent. But within the dielectrio we must put 



The quantities e' and e" are at our disposal for satisfying the boundary con- 
ditions. 

The prescribed discontinuity of the normal component yields e + e' — e" = 0. 
The continuity of the tangential component is certainly satisfied if Fj = 7, 
at all points on the surface of separation, for if we form the derivative dV/ds 
along this surface, it will have the same value on both sides if F, = F,. This 
relation furnishes the second equation connecting e' and e"j 


whence 




K - 1 
e K+ 1 ‘ 


2e 


K 

r+r 


Ex. 78, p. 286. The force acting on the charge — e is — eE, that on the charge 
■+• e is e(E + da grad • E); henoe the force acting on the dipole is 
F = m grad • E. 


Ex. 79, p. 290. The density of surface charge is 

q = ± = 

8 8 

and the energy is 

V = jJ(F, - V t )edS = i C(F, - F*)» 

or, denoting the potential difference by F, 

f7 = £CF*. 

If the charge is held constant, F' = V/K, C — KG and U‘ = U/K, so that 

V — V = J (■- — l) OF* = A V\ for a parallel-plate condenser. 

Now (1 - K)/4r. = -x, V/d = | E 0 1, V/Kd = | E |, Sd = t. in conformity with 
(40), is proved. If the potential difference is held constant, V — KU. 
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Ex. 80, p. 290. The value of the potential inside the sphere as given by 
equ. (25), p.272, must be inserted in (31'), P- 287, and integrated from 0 to o. 
One then obtains lR^2 n 2«5 « „ s 


where e is the total charge. 


Ex. 81, p. 292. Call the distance between the plates z, and measure z from the 
level of the lower plate, which is fixed in position. Then, if z' is the height of the 
counterweight. 


U = U a + £/ mech 


KSV* 
2-4 Jtz 


+ mgz'. 


( 1 ) 


For equilibrium and for a constant oharge (in this case only is the total energy 
given by (1)), i.e. for 




we have 


Since 8z' = —8 z, we have, for 


KSV* 
8nga *’ 


(4) 


It appears strange that A ’occurs in the numerator, while the force of attraction 
between given charges diminished to 1 /K times its former value when a medium 
of dielectric constant replaces one whose value is unity. In this case, however, 
it is not the charge but the potential difference which is held constant. Cf. also 
Exercises 71 and 79. 


Ex. 82, p. 295. If a quantity of electricity de is removed from the condenser 
in a time it its potential difference changes by 


By Ohm’s Law, 

dV = ~u- 

. . (1) 

II 

. . (2) 

Combining, 

"--ll 

. . (3) 

Integrated, this gives 



log V = 

~ m + log v °’ or v “/ v o e ‘ ,CR - 

• ■ (4) 

Hence 

/ = v ° e -t/CR 

• • (5) 


Ex. 83, p. 295. As in the preceding problem, the process of charging the 
condenser is found to proceed according to the equation 
V = F„(l - e-tlCB). 


( 1 ) 



APPENDIX 


849 


For t = T we must have F = F,; hence the time of charging — which, for in- 
stantaneous discharge, must be equal to the period of the pulsating current — is 
given by w 

T = RC log -j p^ . 

Ex. 84, p. 300. A uniformly magnetized sphere may be replaced by a mag- 
netic doublet at its centre. Its potential is 


H= -gradF= - ^ 


sin <j> 3 m sin <f> 2m sin <j> 


Ex. 85, p. 300. It follows from 


that a current whose density is 3-9 X 10 -ls amp/cm. 2 flows into the surface of 
the earth from above. This current is many times the normal atmospheric cur- 
rent, and no adequate explanation of its presence has yet been given. Moreover, 
the reality of the vortical part of the earth’s field is doubted. 

Ex. 86, p. 305. From considerations of symmetry it follows that the magnetic 
lines of force are circles whose centres lie on the axis of the cylinder and whose 
planes are normal to this axis. It follows from 


that, for a constant current density i inside. 


where r is the distance of the field point 'from the axis of symmetry. 

Ex. 87, p. 310. If the coil has N turns, eaoh of cross-section S, the torque 
aoting on it in its normal position in a field H when a current 1 flows is, by equa- 
tion (29') (p. 309), 

NSHI T ( NSH\ 

- c - = g7, (? = — ) (I) 
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The equation of motion of the coil whose moment of inertia is J and whose 
damping constant is r is 

4 + *•’}> + T<f> = ql (2) 

where t is the moment of torsion of the suspension. Under the assumption that 
<|> does not depart appreciably from zero while the current impulse is applied, 
equation (2) may be integrated, giving 

4 = ?//*. or * = ?? (20 


Thus the angular velocity is proportional to the total quantity of eleotricity 
which flows. We then have this value of the initial velocity in the equation 

4 + 4 + T<j. = 0 (3) 


of the free, damped vibration of the coil. The solution, whioh yields (j> = 0 for 
t = ° is 

» - ArWVit sin ( — t) (4) 

For t = 0 we must have 


4>o 


qe 

J 


A V4vJ — r 3 
A 2 J 


ie j M 

V4t J - r*‘ 


. . ( 6 ) 


deflection will be reached at a time t m determined by 
. V4rJ r — r * , , V 4-vJ — r* ^4xJ — r* f 


V4t J - 
2 J 


(6) 

(6') 


For the usual case of weak damping, r<^ 4t J, and the right member may be 


set approximately equal to tan tt/2, and we have 

7t J 

< 7) 

whence <j>max == e (8) 


It should be noticed that the damping constant r is determined largely by the 
induoed current resulting from the motion of the coil (cf. Ex. 88). The “ gal- 
vanometer oonstant ” C, the number by whioh the observed defleotian must be 
multiplied to give the current or quantity of charge, is usually stated. Neglecting 
the damping, the ballistic constant O b is, by the last equation, while 

the constant for steady current, O a = t /q, b y equation (1). The two constants 
are related approximately by O t — <7 0 vV/v == GJPI'Zt;, where T is the period of 
swing. 

Ex. 88, p. 313. In turning away from the rest position, in whioh the normal 
to the plane of the coil is perpendicular to H, the change in flux in a medium of 
permeability 1 is given by dQ>jdt = N8H$. Hence the induoed voltage is 
V t =» — NSH^/c — — g<j> and the induoed current is I f =» —qfy/R. The torque 
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acting on tbe current-carrying coil amounts to M — ql { = —g 2 ^/R. The damping 
constant r in Exercise 87 thus has the value q 2 jR. 


Ex. 89, p. 316. Of. p. 664 et aeq. 


* Ex. 90, p. 316. 
(a) We have 




- 2 a-flt 008 <|> + z 2 ‘ 


In our case the angle between ds 1 and d.s 2 is equal to the polar angle <j>. 

Maxwell succeeded in referring this integral back to tabulated elliptio integrals. 
The formulae for this conversion may be found in Gratz’s Handbuch der Etek- 
trizitdt und des Magnetismus, Bd. V, p. 65. We confine our considerations to two 
ciroles of almost equal size whose distance apart z is small. Introducing the 
largest and smallest distances 

p 2 — Z 2 + («! — a 2 ) 2 and q s = z* + (aq -f a,) 2 

we have 

3 2 - P* = 4a i a 2, “ P a + (? 2 ~ P 2 ) «n 2 


o that the integral beconeB 

L„ = t . 27to, .2 f" 
c Jo 


. = i 


Since, according to our assumptions, q p, we can neglect p as compared 
with q for larger values of the angle 4>- This is not allowable when <|> is small, 
since (f is multiplied by sin 2 <j>/2. We thus divide the range of the integral into 
two parts — one for small values of (j> (0 to a), the other for large values (a to n): 



This formula was given by Maxwell. 
(6) On account of the expression 


the electromagnetic field energy is a positive definite quantity. If we use the 
quadratic form of equation (13) (p. 316), its discriminant must always be positive, 
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The result may be seen physically by considering the flux which a given coil 
causes to pass through itself and through the other coil. The most favourable 
case, e.g. when both coils are wound upon the same iron core, gives L tl L n = L u *. 

Ex. 91, p. 319. The familiar relation for resistances connected in parallel, viz. 

A = J_ + JL 

R R x ^ R,’ 

holds also for complex resistances, i.e. reactances in the complex plane; hence 
1 1 , . , 


Thus for to = l/VLG (resonance) we have R = oo. 

Ex. 92, p. 319. Since with increasing field the permeability of iron p=|B|/|H| 
decreases, the self-induction of an iron-cored coil will also decrease with increasing 
ourrent. Hence the natural frequency of the oircuit increases as the current 
grows. The details are given by Schunk and Zenneck, Jahrbuch der drahtlosen 
Telegraphie, 19, p. 170 (1922). 


Ex. 93, ] 


332. Prom S = 
S! = |H|)is 


c[EH]/4tc it follows that the mean energy flux ir 
_ 2 X 419 X 10’ 


"* e.s.u. = 10 volts/om. 

* Ex. 94, 342. Both terms of equation (43) (p. 337) must be taken into account 
in the transitional region, so that 

H = (1) 

8 = i tan-t 

Hence the phase angle is 

r) = « (t - r - - i tan- 1 (2) 

The wave velocity is that of an observer r, who moves in such way that <j> = const. 
Thus, by differentiation, we obtain 

0 = 1—; + i-r— , (3) 
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Ex. 95, p. 354. We have 0 = 45°, 0" = 28° 7-5', I, = 0-046 I a , I„ = 0-0042 
whence P — 83 per cent. 

Ex. 96, p. 354. We perform the calculation for normal incidence; the pro- 
cedure is entirely similar for oblique incidence. In a vacuum, | H | = | E j, in 
the dielectric | H [ = n | E |. Also 

so that we have (to a common factor) 

, s |,?= «», i s' i = v i s" i = jjv 

from which we see that S = S'j+ S". It is to be noticed that this balance is not 
obtained by simply setting | Sj proportional to A 2 in each medium, since the 
factor of proportionality contains n. 

Ex. 97, p. 356. tan | = must equal unity. In this formula 

n signifies the index of refraction for the ray emerging from the medium into air, 
i.e. n < 1 ; if we put » = 1/n', n' is the usual index of refraction of the material. 
We seek the smallest value of n' for which tan (8/2) can still be equal to unity. 
This extremal problem for n =/<sin0) then yields the equation 



which is an extremum for n' = 2-41. 

For two reflections 

tan - = tan — = V2 — 1 = ^.g V . 8ina 9 ~ n j 
2 8 sin a 0 

For n — 2/3, this gives the two angles 

0, = 60° 20' and 0 2 = 53° 10'. 

* Ex. 98, p. 375, It follows from considerations of symmetry that the curves 
of equal difference of phase are circles. The position of the planes of vibration 
within the crystal may be found as follows. The wave surface corresponding to 
the ordinary ?ay is a sphere, and the same, is true of the normal surface; hence 
in equation (26) of the text (p. 365) we put v n — Vm for a uniaxial crystal. If we 
take the 2 -axis along the optic axis, we have Ii x — 0 for v — Vj„ i.e. D is normal 
to the axis. But since we always have v — v n for the ordinary ray, the plane of 
D is always normal to the axis. In addition, D is in a plane which is normal to n, 
i.e. it is determined by the line of intersection of a plane perpendicular to n and 
one perpendicular to the axis. Thus, for all incident rays corresponding to a 
given plane of incidence, D is in the same direction, viz. normal to the plane of 
incidence. Now, if the plane of incidence is normal to the direction of trans- 
mission of the first Nicol prism, only the ordinary ray will be present in the crystal; 
if the incident plane is parallel to the direction of transmission, the ordinary ray 
will be absent. Hence the isogyre is a right-angled cross. 

Ex. 99, p. 390. If the index diagram be drawn as in fig. 4, p. 383, which cor- 
responds to this case, the amplitude of the direct image will be A/2, where A 
is the amplitude of the incident light. The amplitude of the first order diffraction 
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pattern will be Aj it; the orders 2, 4, ... 2 n yield zero, since in these instances 
the vector diagram is closed for any grating space. The third order, on the other 
hand, yields A/3n — in general, the odd orders have amplitudes A/(2n + l)it. 
The corresponding intensities are 

r _ # r _ r A* , A* 

0 4’ 1 * 9ir*’ * ' Sn+1 ~ (2n + 

The combined intensity of the direct image and the diffraction patterns on both 
sides is 


I 


A*2A* 
4 + 




(2n + 1)*. 


A* 
2 ‘ 


Since the intensity -of the incident light is given by A 2 , and since half the energy 
is absorbed, the energy balance is maintained. Thus all the diffraction patterns 
together have the same intensity as ther direct image. 

Ex. 100, p. 390. The distanoe b between the slits is large compared with the 
slit width a; .as a result, the diffraction pattern of the single slit is crossed by 
maxima and minima whose separation is given by (a -- a 0 )b = ^X. For normal 
incidence, a = hjX/b; for almost normal incidence, a.' — h^X/b 4- e. The best 
visibility of the fringes is obtained for 

« = «',i.e.^~^ + £ or 


If 6 is changed, two adjoining distances for which the best visibility is obtained 
are separated by Ab — X/e. This relation is of use in measuring the distance 
between two stars which are close together and in measuring stellar diameters. 
In the latter case e is approximately the angular distance between the centres 
of gravity of the two halves of the stellar disc. 

Ex. 101, p. 390. From equation (30) (p. 387) it follows that 


• ? 
sm 2 


V Hl * 2 + H t ** + H 3 * 


We limit our considerations to the reflected rays which are normal to the axis 
of rotation; for these H 3 * = 0. Then for 


Hf H* m 0/2 

1 0 I 14° 20' 

I 0 2 29° 30' 

1 0 3 47° 40' 

1 0 4 80° 00" 


HS H* m 0/2 

11 1 20 ° 20 ' 

1 1 2 44° 10' 

2 1 1 33° 20' 


The ray reflected from the plane = 2, H t * — 0 coincides with the second order 
reflection from the plane US = 1, US = 0. Similarly, the angles resulting by 
interchanging US and HS coincide with those already contained in the above 
list, and so are not set down separately. 


Ex. 102, p. 403. It follows from y' =f tana that the diameter of the image 
is 1-74 cm. 


Ex. 103, p. 403. Let the plane in question out the axis at P and the principal 
plane on the object side in the line G. The point P has a corresponding image 
point P' related to it by the image equation, while to G there corresponds a parallel 
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line G' in the principal plane on the image side. G' and P then determine the 
image plane required. 

Ex. 104, p. 400. Project the incident and refraoted rays on the given plane 
and describe a unit sphere about the intersection. The normal to the inoident 
surface, the rays themselves and their projections on each side of the refracting 
plane define a right spherical triangle. Calling the angle of incidence a, the angle 
of refraction a", the corresponding angles with respect to the given plane 0 and 
0", and the angle between the given plane and the plane of incidence (3, the 
sine law of spherioal trigonometry yields sin 0 = sin a sin p and sin 0" = sin a" sin p, 
whence sm0/sin0" = sin a/sin a" = n. 

* Ex. 106, p. 409. To each point of the slit there corresponds a parallel beam, 
and each of these beams corresponds to a point in the focal planS of the camera. 
Only the rays coming from the centre of the slit pass through the prism in a 
direction perpendicular to the edges; in fact, it is usually only the path of these 
rays which is reproduced in diagrams. The rays coming from the upper and 
lower parts of the slit pass through the prism obliquely, and a computation shows 
that their deflection is greater than that of the central rays. The result is that 
the spectral lines are ourved, the convex side being toward the red. The analysis 
is given in E. Kayseris Eandbuch der Spektroskopie, Bd. II. p. 260, Leipzig (1900). 

Ex. 106, p. 409. Taking aooount of the signs, the focal lengths of the front 
and back lens surfaces are 



If d is the lens thickness, the interval D => — + d + /„ so that 


= /i/> = 


£> ( » - 1)j8* 


where R = w(r„ — fj) + (n — l)d. 

Ex. 107, p. 409. Write the sine condition in the form 


sinu' n y‘ 


This relation is, obviously, also valid in the region near the axis, where the sine 
and the angle are interchangeable. Consider a ray from an axial point, distant 3 
from the lens, and making an angle u with the axis;.. Let h be the distance from the 
axis at whioh this ray meets the first surface. Then sinu = h/s. For a very dis- 
tant point, a may be replaced by x, the distance from the principal plane on the 
object side. Furthermore, since y and y' are small, y'jy may be replaced by the 
relation y'jy = —f/x which holds for points near the axis. We then have 


h 

sinu 


-£/=/' 


One consequence of this relation is that the largest possible value of the ratio 
of the diameter of a beam parallel to the axis to the fooal length will be 2, if the 
sine condition is to hold. It is more usual to specify the relative aperture of an 
objective, whioh is defined as the reciprocal of the above quantity. Thus the 
limiting value of the relative aperture will be “// 0-5 ” 

Ex. 108, p. 410. Let the prism be completely filled with light and in the posi- 
tion of minimum deviation. According to the principles of geometrical optics, the 
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optical paths along those parts of the extreme rays included between planes normal 
to the beam must be equal. Calling the base of the prism b and the path in air 
of the ray through the vertex I, we then have 

nb- 21 = 0. 


Also, the limitation of the width of the beam is responsible for diffraction such 
as would be produced by a slit of the same width. The first minimum of the 
diffraction pattern falls in a direction for wliieh the difference of the optical paths 
of the two marginal rays is X, for then, as in the case of a slit, the two halves of 
the beam annul each other. This occurs, then, when 

nb — 21 = X. (1) 


If the direct image for a neighbouring wave-length falls in this position, the two 
Bpeotral lines can just be separated, and we have 


nh + b~d\-2l = 0. 


(2) 


Subtraction of equation (1) gives 



b 


dn 

5x* 


(3) 


Thus the resolving power of a prism is equal to the produot of the base by the 
dispersion of the substance of which it is made. 


Ex. 109, p. 413. By equation (31) (p. 411) the path difference is 
A/ = _2d tana' sin a = 2d x 'n* — sin 3 a. 


Ex. 110, p. 424. 

(a) The attraction between two electrons at 
. (4-80)* x 10-“ 

!0-i* 


apart of 10~* c: 
2-30 x 10 -3 dynes. 


(6) The attraction between the ion groups is given by 

* 10“ to- 

(c) The volume density of the charge in a sphere of radius r is 
3 F 

^ ~ 4irr*. 


where F is the Faraday equivalent charge. The work needed to add 
thickness dr and density of charge p to the sphere is 


dW = 2*S . = V*?rVrfr. 


shell of 


To build up the entire sphere from widely scattered charges requires 
of work 




amount 
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This is the potential energy of the sphere (of.' Ex. 80). If we consider a virtual 
expansion dR, the energy decreases by 


* B? 


This change in energy is equal to the work done by the total 
virtual expansion: 

pdv = p . 4rc B?dR. 

Equating the two quantities, 

p = 10 *° = 4x 10 ” dyne8/cm -‘ ~ 4 x 1017 atm - 

Ex. lll,p.43i. The electrical deviation after traversing a condenser of length 
a is, by equation (8), 


It follows readily from equation (5) or equation (6) (p. 427) that the slope of 
the tangent to the path is 

dy _ Ea 


~2V 


(2) 


If the screen is a distance l cm. from the end of the condenser, the total deflection 

- Tv + “ = fv (2 + 0 = ten “ (f + 0 (30 


Ex. 112, p. 431. As long as a is small, cos a may be set equal to 1. This amounts 
to taking the velocity parallel to the field to be the same for all rays: Inasmuch 
as the projection of the orbit on a plane normal to the field is a circle, all the 
electrons describe helical orbits. It is readily shown that all have the same period 
of revolution: From equations (11) and (12) (p. 428), it follows that m(cPxldt 2 ) = 
—(e 2 /mc 2 )H 2 x. This is the well-known wave equation, for which the period is 
given by T = 2 vmcjeH. The speed v does not appear, so that after a given time 
all particles have described one revolution and have returned to the axis. The 
subject of geometric electron optics, which concerns itself with image formation 
by electrons as brought about by electrio and magnetio fields, has undergone a 
rapid development in recent years. 

Ex. 113, p. 455. If there are in all Vj molecules of Substance 1 present, the 
number of molecules per oubio centimetre is 



where m 1 is the mass, pj the density of Substance 1. Then we have 

= = <»> 

Similarly, for Substance 2, 


( 2 ) 
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The mixture contains a total of v = Vj + v 2 molecules whose average polariza- 
bility is, say, a, whence 


But on account of the linear superposition of the effects of the dipoles, the average 
polarizability is calculated according to the simple rule 


Substituting this in equation (3), it foBows that B = B 1 -f B v 
Ex. 114, p. 455. The molecular refraction of water is 

E _ V-l ,o_7x 18 

“ Jf+] 2 • 18 " “si- - 37 cm ' 8 ’ 


Further, from p. 454, 


Ex. 115, p. 465. Since v„ = 6, we obtain 
Ne*/m O . 


1 dV^rg 


Thus the phase velocity in the Heaviside Layer is greater than e 
Ex. 110, p. 467. We have 

4rtWp* K - 1 n ! - 1 . „ 


(The value of n extrapolated to X = » is about 4/3.) It then follows that 
p = 0-8 X 10-“ e.s.u. 

This value is too small; the actual value is 1-8 x 10 _ “. The reason for this 
discrepancy is association taking place in the liquid state, in whioh case the 
dipoles are no longer independent. The true value is to be obtained by measure- 
ments on gases or, in other cases, or solutions in dipole-free solvents. 

* Ex. 117, p. 483. For tho medium at rest, « 0 — c/v 0 ; further, from equation 
(41) (p. 244), the phase velocity in the moving medium, for an observer at rest, is 
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where u is the velocity of the medium («<C c ). Then 

« = »o + *(l — 

Ex. 118, p. 494. p is given by equation (6); there results p — 232 atm. 

Ex. 119, p. 500. Prom the definition of x, this quantity for an ideal gai 
given by ' 


1 1 1 6 , a \ 

RTv 2 a™ b\ a p \ v pi?)’ 

(T^ji -p p \ 1+ i)~* 

^efficient of Au, as 
re have 

= 27^1 _ J_y 

4 \ 3cJ 


Ex. 120, p. 510. Setting the coefficient of Au, as given by (34), equal to zero 
and combining with (26), p. 499, we have 


Expressing co in terms of t and substituting in the reduced equation of state, 
we obtain the equation of the transition curve: 

n = -27 + 24V3vi - 12-r. 


Ex. 121, p. 618. By no means! Electrical energy is equivalent to mechanical 
work; and according to the second law one can, by expending work, transfer 
heat from a oooler reservoir (say a body of water) to a warmer one (a boiler), 
cf. p. 517. If Tg = 283° and T t = 293°, then t, = 10/293 == 1/30 and W % QJ29 
or Q 1 = 29 W. Thus the thermal effectiveness may be increased, theoretically, 
by a factor of almost 30. “ Heat pumps ” of this kind are coming into use; the 
actual efficiency is, of oourse, less than the figure given. 


Ex. 122, p. 624. Starting at a 
mixing is 


point, say 0 8 C., the entropy before 


101 °g^ + 20 log- 


(speoific heat of water 


= 1). After mixing, the final temperature is 43-3° C„ 

-j 


In taking the difference, the initial temperature 273 drops out — as it must do — 
and we have 


£,-£, = 30 log 316-3 - 10 log 373 - 20 log 288 = 0-23 oal./deg. 

Ex. 123, p. 524. In one point only. If the two curves could intersect in two 
points, they would enolose a surface which represents work done. One could 
thus construct a perpetuum mobile of the second kind by permitting work to be 
done along the isothermal — the energy being taken from a boiler — and allowing 
the system to return to its initial configuration by moving along the adiabatic. 
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Ex. 124, p. 624. The areas under the two arcs must be equal, for it must be 
impossible to construct a perpetuum mobile of the second kind by traversing this 
part of the curve. 

Ex. 125, p. 528. We have 

TdS = dU + ®de =(§?)/? + (~)/e + 3>de 

- r [(£),*+ (I)." 1 ] <» 


Comparing coefficients. 


©.-*[©.-+•] « 

ffl.-i®.- « 


iGStPj -At©.**] -iS» ■ • «*» 

— '®.~®,- •« 

If the Faraday equivalent charge F has passed, one gramme equivalent has been 
converted and the energy difference is equal to the heat of reaction per mol 
divided by the valence Z, i.e. equal to —q 0 ~/Z. Then 

(8U\ = _ ?1 T 

\de/ T ZF 

-* ®- T (S)-lv <«’ 


which is Helmholtz’s equation for the galvanio cell. Observe that ZF<t> corxe- 
sponds to the free energy. 

Ex. 126, p. 528. It follows from the first law, at constant pressure, 

$ q = du+ pdv =d(u+pv) = dh = (jfi) p dT ‘ 
whence = (S)* 

By analogy with equations (34) to (37) on pp. 527-528 it follows that 
= (jfr) p iT + (^) r dp = f( du + P dv > = f <dh - vd P> 
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Through comparison and “ crosswise differentiation ” there results 

(g 

uation (38) (p. 528), replace 

and with the equation of state l = (G/T)H, we get 

iff 

<=ii - c, = -y = x y-, 

where x = susceptibility per mol and c { is the ordinary specific heat. 

* Ex. 128, p. 584. For T = 0, i.e. for a: -*• oo, * tanh x — log cosh x becomes 
equal to 2, and so s m = — fi! log 2. Statistically, TF ma g = 1, «m»g = 0. 


“■ ©I®! erer"©’ 

so that s unmag = kL log 2 = R log 2. 

Thus the entropy, in agreement with the above, is an amount R log 2 greater in 
the unmagnetized state than in the magnetized state. 

* Ex. 129, p. 608. The energy levels of the rotator are given on p. 681. In 
the old quantum mechanics, 

m*h* 

U « = 8^j W 


Of a total number L of oscillators, the number in the mth quantum state is given 
by the Boltzmann Principle as 

„ Le-m'h'I^HT Le-m'x 

~ = (2) 

where x = h*/Stx?IkT. Remembering that there are two rotational degrees of 
freedom, i.e. two axes normal to the axis of figure, the total energy is 

U r = 2 (3) 


Differentiating with respect to T, we obtain — after some computation — the 
rotational part of the speoifio heat: 


Or 


(4) 


The evaluation of the sums was first carried out by P. Ehrenfest, Verh. d. Deut- 
schen Phys. Qes. 15, p. 455 (1913). It leads to qualitative agreement with the 
results found for U v Nevertheless, this simple theory is considerably in need of 
improvement,. Every advance in the quantum theory is marked by an improve- 
ment in the theory of e,. This was terminated by the discovery that H, is a mix- 
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ture of two modifications — ortho- and para-hydrogen — one of which can possess 
only even, the other only odd quantom numbers. For details consult Eucken’s 
Lehrbuch der chemiechen Phyaik, Edn. 2, II, 1, p. 253 (1943). 

* Ex. 130, p. <524. According to p. 616, radiation incident at an angle 8 exerts 
a pressure amounting to pg = (2S/c) cos*0 and so in an angular interval d0, 
dp = 2tt(2S/c) cos* 0 sin0d0. Integrating from 0 to w yields p = 8toS/3c. But S v 
is identical with K v in equation (15), and so p v — p8/3 and p = p/3. Now, at 
temperature (T + dT ), let a piston be pushed forward by radiation pressure of 
amount p + (dpjdT)dT until the volume increases by v; then follow with adia- 
batic expansion that lowers the temperature from {T + dT) to T. Finally, 
perform a compression at temperature T. The work done is v{dp/dT)dT, while 
heat energy of amount 

(p + %dT)v+(p + %dT)x> 


was supplied in the expansion. Neglecting quantities that are small compared 
with p and p, the second law yields 


dp 

dT 


dT 


(P + 9) AT rfp_4p 
T ’ ° r dT~ T' 


Integration gives the result 


Written in exothermal form, the reaction is 

Ion + Electron = Atom -f- q 0 ~. 

The quantity q 0 ~ is calculated from the ionization potential V f according to 


The chemical constants of the ion and atom are equal; also, the specific heat of 
each of the monatomic gases is 5R/2. If the degree of dissociation (ionization) is 
a, the partial pressure of the ions is 


that of the electrons is 

and that of the atoms 


since, in all, (1 + a)N particles result from N neutral atoms. Then, by equation 
(44) (p. 541). 

108 --*iTT + i '° sT - 9 ' 48 - 
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For Ca (ionization potential 6 volts) at 6000° and 1 atm. we obtain a = 8 per 
cent; at 6000° and 1(J^ 2 atm., a — 64 per cent. Thus, observation of the relative 
intensity of the lines of Ca and Ca+ in stellar spectra famishes us with the possi- 
bility of determining the conditions of temperature and pressure. This was 
first recognized by M. N. Saha, to whom the theory in its simple form is due. 
Consult Handbuch der Astrophysik, Vol. I, Berlin, J. Springer (1929). 

* Ex. 132, p. 674. 

LS coupling: + « a yields (a) S = 0 (singlet system), (6) 8 = 1 (triplet 
system); l x -j- yields L — X. (a) gives rise to the term 
l P v and (6) to the three neighbouring terms *P 0 , *P V *P 3 . 

jj coupling: s x -f l x yields (a) j x = $, (6)j, = f ; s 3 + k yi elds = i- 
(a) gives rise to two neighbouring terms having J — 0 and 
J = 1, and (6) to two terms dose together having J = 1 
and J — 2. 

Thus LS coupling produces a grouping of 1-and 3 terms, jj coupling produces 
two pairs of terms. 

* Ex. 133, p. 674. In the longitudinal Zeeman effect the absorption frequenoy 
for right-handed circularly polarized light is somewhat different from that for 
left-handed circularly polarized light. The component which rotates in the same 
sense as the current which would cause the magnetic field and which, by p. 663 , 
is displaced toward higher frequencies is designated the right-handed component. 
In the normal effect, 

Av = i^mc H ’ i-e ’ V °’ r = V ° + Av ' V °’ I = v ° “ Av - • • (!) 


Now this displacement of the characteristic frequencies causes a change in the 
refractive index (p. 461); and a difference in refractive index — i.e. in phase 
velocity — for the right- and left-handed rays means, according to p. 65, a rotation 
of the plane of polarization in the direction of turning of tho faster-moving wave, 
i.e. the wave with the smaller index of refraction. The angle of rotation is equal 
to half the phase difference, so that for a layer of thickness D we have 


$ 

2 


2 nvD 


(a, - » r ). 


(2) 


According to the disperse 


Nf^/m 

2rt[(v 0 - Av)* - v*]’ 


At higher temperatures — a condition always satisfied when investigating gases — 
both components are of the same intensity, so that ft = f r = //2, where / is the 
strength of the oscillators for the uninfluenced line. At lower temperatures, at 
which the line spectra of crystals are observed, one of the Zeeman components 
may, on account of the Boltzmann distribution of orientations, be considerably 
weaker than the other, so that f T and f t will be different, and this can cause the 
sign of the effect to reverse, giving the so-called “ paramagnetic Faraday Effect ”, 
Considering only the normal case where /, =/ r = // 2, we have 

Nfe* f 1 1 ) Nfe> 4Av . v„ 

1 r 4vm t^-Avp-y* [vo + Av]*-v*/“4mn‘(v 0 *-v*)*’ 


(4) 
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Thus the rotation is proportional to H. Similarly, we find that in transverse 
observation, on account of the equality of the intensities of the parallel and 
perpendicular components, there is no linear effect but rather a small quadratic 
one. In the same way, the Kerr effect is found to be given as a consequence of the 
Stark effect. 


Ex. 134, p. 743. According to equations (6) and (6) of the text we have, for 
a given velocity, dEfdx proportional to 2* but independent of M, and dE pro- 
portional to M, so that the range will be proportional to Mz~*. Thus R a — fc(4/4) 
and R B = fc( 1 -0081/1 ), from which R a = 1-008 S a ; i.e. for the same velocity the 
ranges are nearly equal. But we wish to know, from the ranges, the energies of 
the particles. Because the energy, for a given velocity, is proportional to the mass, 
we have that 


Eg 1-008 


Eg = 3-02 Mev. 


This is the proton energy corresponding to a range of 14-53 cm. For heavy partioles 
of these energies, relativistic effects may be neglected. 

Ex. 135, p. 763. (o) p~ decay (neutron-*- proton + $~, Z Z + 1): -Nucleus 
Z stable if M, +I + me i > M, or, after addition of Zm^ u A, +1 > A r 

(b) (3 + decay (proton ->■ neutron + |£ + , Z-* Z — 1): Stability of nuoleus Z 

if if,_i + mei > M, or, after adding Zm*\, + 2mei > A,. 

(c) ^-capture (proton + P - -*- neutron, Z-* Z — 1): Stability of nucleus Z 

if M z _ , > M z + »*e: or, after addition of (Z — l)mei, A,- t > A t . 
Stability with respect to (a) and (c) — hence also for (6) — is assured if A, < 
and A z < A t+V 

Ex. 136, p. 819. Putting p = 4 in (31), p. 813, and using (43), p. 818, 

J t/t = V 2/mc (sin x/x — cos x). 



Table I 

CONVERSION OF UNITS 

All equations in the text are in the absolute C.6.S. system of units and some 
are given also in M.K.S. units. If it is desired to introduce into the C.6.S. equations 
a quantity which is given in other units, it must first be multiplied by the numeri- 
cal factors given in the table; if the result of the computation is to be expressed 
in other units, it must be divided by these factors. All electrical quantities are 
in the absolute electrostatic system, all magnetic quantities are in the electro- 
magnetic system of units. 


Mechanical Quantities 

1 kg. wt. = 981,000 dynes = 9-81 newtons. 

1 atm. (old unit) = 760 mm. Hg = 1-013 X 10* dynes cm. -1 

= 1-013 x 10 5 newtons m. -2 
= 1013 millibars, 

1 mm. Hg = 1333 dynes cm. -2 , 

1 atm. (new unit) = 1 kg. cm. -2 = 981,000 dynes om. -2 . 

1 joule = 10’ ergs, 

1 cal. = 4- 19 X 10 7 ergs = 4-19 newton m., 

1 large oalorie (kg. cal.) = 4-19 x 10*° ergs = 4190 newton m. 

1 eleotron volt = 1-60 X 10 -12 erg = 9-66 x 10 11 erg mol -1 
= 23,030 cal. mol. -1 , 

1 kilowatt hour = 3-6 X 10 12 ergs. 

1 watt = 1 joule see. -1 = 10 7 erg sec. -1 , 

1 horse-power = 746 watts. 

Modulus of elasticity: 1 kg. mm. -2 = 9-81 X 10 7 dyne cm. -1 . 

Compressibility: 1 atm. -1 (new unit) = 1/981,000 = 1-02 X 10 - * cm. 2 dyne -1 . 


Force: 

Pressure: 


Energy: 




Electrical Quantities ( with dimensions in e.s.u.) 


Charge: 

Current: 

Potential: 

Resistance: 

Capacitance: 

Inductance: 


coulomb 

e.m.u. 

ampere 

volt 




e.m.u. 

microfarad (mf.) 
henry 


= 1 X 10“ cm." 2 gm. 1 / 2 sec. -1 , 
= 3 X 10“ e.s.u. 

= 3 X 10® om." 1 gm. l/2 seo. -2 , 
= 3 X 10“ e.s.u. 

= 1/300 om. 1 / 2 gm. 1/2 sec. -1 , 

= 1/(3 X 10“) e.s.u. 

= 1/(9 X 10 11 ) cm. -1 sec., 

= 1/(9 x 10 20 ) e.s.u. 

= 9 X 10 11 cm., 

= 9 X 10*° e.s.u., 

= 9 X 10* om. 

= 1/(9 X 10 u ) cm. -1 sec. 2 , 

= 1/(9 X 10 20 ) e.s.u. 


(8 711 ) 
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Important relations: 

If the magnetic flux through, a circuit changes at the rate of l .maxwell/soc., 
an electromotive force of 10~ 8 volt is induced. 

Hie energy sent to an area of 1 cm. 2 normal to the direction of radiation, and 
at a distance of 1 m. from a Hefner lamp is 900 ergs/sec. Of this, about 1 per 
cent falls in the visible region. (1 He&er unit = 0-9 international candle.) 

Excitation (or ionization) potential is connected with the wave-length of the 
corresponding radiation (cf. p. 432) by 

X X V = 12380. 

(A.U.) (volts) 


De Broglie wave-length and volt velocity of electrons are related by (of. 
equation (24), p. 695) 


X = 
(A.U.) 


im 

Vt- 

(volts) 


Relation of energy in electron volts and in ergs: 

lev= 1-601 X 10-“ erg. 

Mass-energy relation, atomic mass units, and mega-electron volts: 

- 1 a.m.u. = 932 Mev. 


Table II 

PHYSICAL CONSTANTS* 

Charge on the electron 

e = (4-80286 ± 0-00009) X 10~ 12 e.s.u. 

Rest mass of the electron 

m = (9-1083 ± 0-0003) X 10-“ gm. 

Ratio, proton mass to electron mass 
U+/mN = 1836-12 ± 0-02. 

Planck’s constant 

h = (6-62617 ± 0-00023) X lO-* 7 erg sec. 

Boltzmann constant 

h = R/L = (1-38044 ± 0-00007) X 10~“ erg. deg.- 1 . 

Velocity of light 

c = (2-997930 ± 0-000003) x 10“ cm. sec.- 1 . 

Constant of gravitation 

y = (6-664 ± 0-002) X 10~ 8 dyne cm. 2 gm.-*. 

Specific charge of the electron 

e/m = (5-27305 ± 0-00007) X 10“ e.s.u. gm.- 1 . 

Bohr magneton 

lij, = (9-2837 ± 0-0002) X 10 -21 erg oersted- 1 . 

Faraday 

eL = 96,621-9 ± 1-1 coul. equiv. -1 (chemical scale). 

* Values taken largely from Cohen, Du Mond, Layton and Rollett, Rev. Mod. Phys. 
27, pp. 363-380, Oct. 1966. 
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Avogadro’s number (Loschmidt’s number) 

L = (6-02486 ± 0-00016) x 10“ (physical). 

Number of molecules in 1 cm.* of gas at 0° C. and 1 atm. 

N = 2-6872 ± 0-0001 X 10 1 * cm."*. 

Volume of 1 mol at 0° C. and 1 atm. 

t>„ = (22-4207 ± 0-0006) X 10 s cm.* mol” 1 . 

Universal gas constant 

IcL = It = (8-31696 ± 0-00034) x 10’ erg. mol- 1 deg." 1 

= 1-98 cal. deg.- 1 . 

Constant of Stefan-Boltzmann law of radiation 

c = (6-6687 ± 0-0010) X 10“* erg cm.-* sec.- 1 deg."*. 

Constant of Wien’s Displacement Law 
b = 0-289782 ± 0-000013 cm. deg. 

Rydberg’s number (infinite mass) 

R a => 109737-309 ± 0-012 cm.-*. 

Radius of the first hydrogen electron orbit (first Bohr radius) 
ox = (6-29172 ± 0-00002) x 10“* cm. 
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Abaf io point, 846, Ex. 74. 

Abbe’s sine law, 403. 

Aberration, 246. 

Absolute electrometer, 292. 

— space, 228. 

— velocity, 230. 

— zero, unattainable, 669. 
Absorbing media, 367. 

Absorption frequency, 463. 
Absorptivegsower, 620. 
Accelerated reference frame, 231. 
Acceleration, 82. 

Action, 123. 

— at a distance, 286. 

— variable, 129. 

Adiabatio change, 606. 

— law, 212. 

— lines, 607. 

— principle, 662. 

Aeromechanics, 191. 

Affinor, 33 n. 

Alpha-particle, 646. 

— scattering of, 641. 

Alpha-ray, 748. 

Alternating current, 316. 

transient phenomena, 319. 

Alvarez, 772. 

Ampere’s theorem, 306. 
Amplitude, 60. 

Anfahrwirbel, 211 n. 

Angular frequency, 60. 

— momentum, 89. 

total, 110. 

Anisotropio substances, 278. 
Ankylosis, 698. 

Antecedent, 36. 

“ Anti-Stokes ” line, 721. 
Apparent force, 231. 

Arc spectra, 674 n. 

Argand diagram, 69. 

Associated harmonics, 700, 821. 
Astigmatic difference, 409. 
Aston’s isotope rule, 762. 

Atled operator 30 n. 


Atom, model of, 647. 

— structure of, 641. 

Atomic mass unit, 760. 

— number, 431, 646. 

— observations, inexactness of, 690. 
Avogadro’s law, 495. 

— number, 420, 672, 698. 

Axes, optical, 369. 

Axis of figure, 163. 

Babinet’s theorem, 380. 

Balmer series of spectral lines, 649. 

Band group, 686. 

— spectra, 680. 

— system, 686. 

Band, 731. 

Bars, longitudinal waves in, 180. 

Beats, 67, 323. 

Bernoulli’s equation, 197, 203. 

Berthelot’s principle, 613. 

Bessel functions, 810. 

asymptotic forms of, 817. 

of second kind, 816. 

of third kind, 817. 

orthogonality of, 816. 

Beta-ray, 745. 

Binormal, 19. 

Biogenetios, fundamental law of, 675. 
Biot-Savart law, 110, 206, 207, 300, 303, 
337, 338. 

Black body, 613. 620, 624. 

Bloch and Alvarez, 772. 

Bohr magneton, 665, 744. 

— model of atom, 647. 

— theory, 707. 

Boiling-point, elevation of, 646. 
Boltzmann constant, 462, 693. 

Bond, non-polar, 712. 

Born, M., 706. 

Bose-Einstein statistics, 624. 

Boundary layer, 210. 

detachment of, 210, 211 n. 

waves in, 345. 

Boyle and Mariotte, law of, 494, 565. 
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Boyle’s law, 212. 

Brackett series of spectral lines, 650. 
Bragg law, 388. 

— method, 389. 

Breakdown, electrical, 436 
Bremsstrahlung, 743, 779. 

Brewster’s law. 353. 

Brownian movement, rotational, 602 n. 
Bulk modulus, 173. 

Bunsen’s effusometer, $42, Ex. 61. 

Calculus of variations, 75. 

Calorie, gramme, 487. 

— largo, 487. 

— small, 487. 

Canal rays, 440. 

Doppler effect in, 441. 

Canonical transformation, generator of, 
127. 

— transformations, 124. 

Cantilever beam, bending of, 176. 
Capacity, 274. 

Carnot cycle, 514. 

Cathode drop, 437. 

— rays, 426, 427. 

1- origin of, 440. 

Cations, 4)9, 441. 

Cauchy integral theorem, 72. 
Canehy-Riemaim conditions, 71. 

equations, 201. 

Caustic, 406. 

Cavity, 613. 

Central axis, 144. 

Centre of gravity, 109. 

— of inertia, lp9. 

— of mass, motion of, 108. 

Characteristic curve of thermionic tube, 

799. 

— functions, 186. 

— radiation, infra-red, 607. 

— values, 185. 

of wave equation, 698. 

Charge density, 703. 

— surface density of, 266. 

Chemioal affinity, 563. 

— bond, nature of, 712. 

— constant, 541 636. 

of gas, 638, 557. 

— equilibrium, 543. 

— valence, 716. 

Circuits, inductively coupled, 322. 
Circularly polarized waves, 64. 
Circulation, 202, 203, 210, 211. 

Classical quantum statistics, 603. 


Classical statistics, 579, 603. 
Clausius-Clapeyron equation, 533. 

Cloud chamber, 740. 

Wilson’s, 642. 

Coefficient of emission, 619. 

Collinear system, 403. 

Combination lines, 660. 

Compensator, 357. 

Complex potential, 201, 209, 210. 

— velocity, 203, 207, 208, 210. 

“ Complexion ”, 682. 

Compressibility, isothermal, 493. 
Compton effect, 687, 690, 691, 743. 
Conditionally periodic systems, 132. 
Conducting media, propagation in. 333. 
Conduction, electrolytic, 417. 

— metallic, 444. 

— of electricity in gases, 426. 

— of heat, 670, 671. 

in metals, 447. 

Conductivity, 294. 

— electrical, 631. 

— equivalent, 421. 

— surface, 489. 

— thermal, 488. 

— thermometric, 489. 

Conformal mapping, 69, 70. 

Conjugate complex velocity, 201. 

— tensors, 35. 

Consequent, 36. 

Conservation of energy, principle of, 88. 
Conservative forces*i67. 

Constrained particle, 103. 

Contravariant components, 12. 
Convection coefficient, Fresnel’s, 477. 
Convergence ratio, 401. 

Co-ordinates, curvilinear orthogonal, 40. 
Coriolis acceleration, 233. 
Correspondence principle, 655. 
Corresponding states, theorem of, 500. 
Cosmic-ray showers, 781. 

Cosmic rays, 780. 

Coulomb, 267. 

Coulomb’s law, 266. 

Couple, 144. 

Coupled circuits, 322. 

Coupling, Russell-Saunders, 689. 
Covariant components, 12. 

Crystal, biaxial, 373. 

— cohesive forces in, 574. 

— compressibility of, 577. 

— lattice, 360. 

— uniaxial, 370. 

Crystals, properties of, 573. 
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Cubical dilation, 166. 

Curie and Joliot, 766, 756. 

— constant, 468. 

— law, 629. 

— point, 460, 461. 

Curl of a vector, 26. 

Current density, 293, 726. 

— strength, 293. 

Curves of equal thickness, 411. 

Cut, 73 n. 

Cyclic variables, 126. 

Cylinder functions. See Bexael functions. 

d’Alembert’s principle, 113, 116. 

Dalton’s law, 496. 

Damping, 320, 348. 

Debye-Hiiokel theory of electrolytic con- 
ductivity, 421, 423. 

Decomposition of lithium, 760. 
Deformable solids, mechanics of, 161. 
Degenerate system, 136. 

Degree of degeneracy, 136. 

Del operator, 30 n. 

Delaunay elements, 136. 

Density of charge, 266. 

— of ideal gas, 680. 

Deuteron, 653, 743, 751. 

Dia-electric susceptibility, 280. 
Diamagnetic susceptibility, 463. 
Diamagnetism, 461. 

Diatomic molecule, 586, 605. 

— molecules, rotation spectra of, 702. 
Dielectric constant, 267 n., 277. 

electron theory of, 450. 

— constants, principal, of crystal, 362. 

— displacement, 277. 

— media, 277. 

— polarization, 461, 473. 

Dieleotrics, polarization of, 279. 
Diffraction at circular aperture, 390. 

— at slit, 382, 390. 

— optical, 696. 

— reciprocal theorems of, 379. 

— theory of, 376. 

Diffuse series, 660. 

Diffusivity, 489. 

Dilatation, Einstein time, 243. 

Diopter, 403. 

Dipole, 272. 

Discord, 99. 

Dispersion, 66, 451, 716. 

— anomalous, 453. 

— region of, 413. 

Displacement current, 327, 328, 476. 


Divergence of a vector, 23. 

— theorem, 24. 

Domains, magnetic 462. 

Doppler effect, 235. 246, 691. 

Double layer potential of, 273. 

Doublet, 273. 

— magnetic, 298. 

Droplets, 263. 

Dulong-Petit law of atomic heats, 598. 
Dyad, 36. 

Dyadic products, 36. 

Dynamics of rigid bodies, 143. 

Earth’s horizontal magnetic field, in- 
tensity of, 297. 

— pole, wandering of, 165. 

Eccentric anomaly, 137. 

Efficiency of heat engine, 517. 
Effuscmeter, Bunsen’s, 842. 
Eigenfunktion, 186 n. 

Eigonwert, 186 n. 

Einstein-de Hass effect, 111. 

anomaly in, 669. 

— time dilatation, 243. 

Einstein’s velocity addition theorem, 244, 
245. 

Elastic potential, 175, 176. 

Elasticity, 161. 

Elastodynamics, 161. 

Elastomers, 803. 

Elastostatics, 161. 

Electret, 473. 

Electrical-charge as source of flux, 264. 

— oscillator, 338. 

Electrocaloric phenomena, 528. 
Electrodynamics of moving bodies, 471. 
Electrolytic theory of Arrhenius, 422. 
Electromagnetic equations, invariant form 

of, 480. 

— waves, 360. 

in conducting media, 333. 

in two media, 343. 

propagation of, 328. 

Electrometer, absolute, 292. 

Electron, 426, 429. 

— mjss, variability of, 476. 

— optics, 443, 787. 

— revolving, 463. 

— spinning, 669. 

Electrons, secondary, 433. 

— thermal emission of, 433. 
Electrpphoretic force, 423. 

Electrostatic potential, 267. 

Elementary cell, 574. 
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Ellipsoid, Fresnel, 363. 

— index, 363, 366. 

— of stress, 169. 

Ellipsoid, tensor, 363. 

Elliptic motion, left-handed, 69. 

right-handed, 69. 

Elliptioally polarized waves, 64. 
Emisaivity, 489, 619. 

Enolosnre, 613. 

Endothermal reaction, 640. 

Energy, 86. 

— bands, 726. 

— density, 176, 287, 616. 

— distribution, 688. 

— function of a gas, 603. 

— inertia of, 262. 

— internal, 627. 

Enthalpy, 609. 

— free, 626. 

Entrance pupil, 786. 

Entropy, 622, 624, 640, 666, 679. 

— constant, 636. 

Equation of state, 493, 627. 

of ideal gas, 494. 

of real gas, 497. 

Equations of motion, canonical form, 122. 
Equilibrium, chemical, 639. 

— of disintegration products, 746. 

— of sedimentation, 689. 

— of thermodynamic systems, 631. 

— thermodynamic, 619. 

Equipotentials, 201, 202. 

Ether, 369. 

Euler formula, 60. 

Euler-Lagrange differential equation, 76, 
77. 

Euler-Masoheroni constant, 809. 

Eutectio points, 634. 

Exchange energies, 712. 

— forces, 712. 

Excitation potential, 649. 

Exit pupil, 786. 

Expansion, iBobario volume coefficient of, 
493. 

External forces, 108. 

“ Falling characteristic ” of arc, 440. 
Faraday effect, 674, Ex. 133. 

paramagnetic, 863, Ex. 133. 

— equivalent charge, 664. 

Fermat’s principle, 395, 407. 

Fermi-Dirac statistics, 626, 631. 

Fermi gas, 769. 

Ferromagnetic materials, 306, 306. 


Ferromagnetic media, 343. 

— metals, 299, 460. 

Field equations, Hetz’s solution of, 335, 

— strength, electrioal, 283. 

Fields, quasi-stationary, 311. 

— rapidly alternating, 327. 

— slowly varying, 311. 

“ Fine structure ”, 669. 

First law of thermodynamics, 495. 
Fission of heavier nuclei, 767. 

Fizeau effect, 246. 

Fizeau’s experiment, 477. 

Fleming’s left-hand rule, 309. 

Flow, irrotational, 196, 197. 

— laminar, 214. 

— rotational, 196. 

Flux, 264. 

Focal length, 399. 

of electron lens, 763. 

— plane, 396. 

Focus, principal, 420. 

Force, 84. 

— moment of, 110. 

Forced vibrations, 96. 

Fortrat, 684. 

Foucault pendulum, 236, Ex. 66. 
Four-dimensional world, 247. 

Four-force, 266. 

Four- velocity, 266. 

Fourier integral, 64, 66. 

— series, 54,' 101. 

multiple, 132. 

Fraunhofer diffraction phenomena, 382. 
Free charge, 277. 

— energy, 626. 

— oscillations, 320. 

— path, 438. 

Freezing-point, lowering of, 646. 
Frequency, 60. 

Fresnel diffraction phenomena, 382. 
Fresnel’s formulae, 360, 362. 

Frictional resistance, 94. 

Funicular polygon, 145. 

Galilean transformation, 229, 230. 
Galvanometer, moving-coil, 309. 

Gamma functions, 808. 

— -ray, 746. 

Gas constant, 593. 

Gauss plane, 69- 
Gauss’s theorem, 23, 270. 

Gay-Lussac’s experiment, 604. 

— law, 494. 

Geiger-Mfiller counter, 741. 
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Gelger-Nuttall formula, 747. 
Generalized co-ordinates, 118. 

— momentum co-ordinates, 121. 
Geodesic, 75. 

Geometric velooity, 246. 

Giorgi system of units, 266. 
Glancing angle, 389. 

Goudsmit and Uhlenbeck, 669. 
Gradient, 21. 

Gravitational mass, 259. 

Green's theorem, 270. 

Grid, 434. 

Ground wave, 346. 

Group velooity, 65, 246, 694. 
Gyrocompass, 159. 

Gyroscope, 152. 


Hahn and Strassuaaun, 754. 

Half-value period, 746. 

Hall effect, 734. 

Halley’s formula, 192. 

— law, 589. 

Hamilton-Jaoobi differential equation, 
126, 127, 130. 

Hamilton’s characteristic function, 128. 

— equations, 121. 

— Principle, 123, 124, 693. 

Hankel functions, 817. 

Harmonio analyser, 56 n. 

— vibrations, 93. 

Heat conduction, differential equation of 
488. 

— engine efficiency, 517. 

— function, 509. 

— of fusion, 539. 

— of sublimation, 539. 

— of vaporization, 536. 

— specific, 487. 

— theory of, 484. 

Heaviside layer, 456, 868, Ex. 115. 
Heisenberg, 705. 

Heitler and London, 712. 

Helmhotz - Lagrange equation, 405. 
Helmholtz’s vortex theorems, 205. 
Herpolhode, 141, 142. 

Hersohel effect, 738. 

Hertzian oscillator, 335. 

— vibrator, 338. 

Hess’s law, 512. 

Hittorf s relation, 420. 

Hodograph, 81. 

Holonomous-rheonomous system, 121. 
Holonomous systems, 114. 


Hooke’s law, 171. 

Huygens’ formula, 376. 

— principle, 368- 
Hydrodynamicsl equations, 196. 
Hydromechanics, 191. 
Hydrostatics, 191. 

Hyperbolio orbits, 702. 
Hyperfine structure, 743. 
Hysteresis loop, 461. 


Image distance, 399. 

— space, 397. 

Images, electrical, 275. 

Impedance, 318. 

Impulse, 86. 

Incidence, principle angle of, 357. 
Index (of complex number), 50. 
Indicator diagram, 516. 

Induction, 264, 473. 

— law, 311. 

— magnetic, 298. 

Inertia, 83. 

— of energy, principle of, 268. 

— 'product of, 149. 

Inertial force, 231. 

— frames, 229. 

— mass, 84, 254, 259. 

— resistance, 167. 

Insulating media, 349. 

Intensity of magnetization, 298. 

— of wave, 724. 

Interference fringes, 395, 410. 

— instruments, 410. 

Interferometer, Fabry-P6rot, 411, 412. 
Internal forces, 108. 

Ion, 418. 

“ Ion cloud ”, 650. 

Ionization bursts, 765. 

— photo-, 432. 

— potential, 432, 649. 

— thermal, 432. 

Irrotational flow, 196, 198. 

— region, 197. 

Isobares, 759. 

Isogyre, 376, 853, Ex. 98. 

Isometrio pressure coefficient, 493, 567, 
Isothermal compressibility, 493. 

— lines, 498. 

— volume elasticity, 493. 

Isotope of hydrogen, 743! 

Isotopes, 746. 

Isotopic mass, 766 n. 

Isotropic substance, 171, 278. 
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Jacobian function, 107. 

Joule heat loss, 296. 

Rater’s pendulum theory of, 161. 

Kelvin temperature scale, 618. 

Kepler ellipse, 666, 669. 

Kepler’s first law of planetary motion, 91. 

— second law, 89. 

— third law, 92. 

Kerr effect, 864, Ex. 133. 

Kinematics, 81. 

Kinetic energy, 86, 112, 113. 

— theory of matter, 663. 

Kirchhoff’s formula, 376, 378. 

— laws, 294, 620. 

Kutta-Joukowski lift formula, 208, 210. 

Lagrange’s equations of the first kind, 
113, 117. 

— of the second kind, 118, 120. 
Lambert’s law, 617. 

Laminar flow, 214. 

Langevin function, 463. 

Laplace’s equation, 71, 269. 

Larmor frequency, 772. 

— precession, 664. 

Lattice constant, 674. 

— defects in dielectric crystals, 736. 

— energy, 676. 

Laue-London theory, 466. 

Law of areas, 89, 

— of equipartition of energy, 595. 

— of gravitation, 89, 90. 

— of mass action, 539. 

Le Chatelier’s law, 642 n. 

Left-handed elliptic motion, 59. 

Legendre functions of second kind, 824. 

— polynomials, 699, 820. 

orthogonality of, 822. 

Level surfaces, 21. 

Light quantum, 432. 

— principal velocities of, in crystal, 
362. 

Line integral, 22, 72. 

— of nodes, 136, 137. 

— spectra, 680. 

Linearly polarized waves, 64. 

Lines of force, 264. 

— of motion, 194. 

Liouville’s theorem, 686. 

Liquid films, 227. 

Lissajous figure, 58, 60, 132. 

Local time, 265. 

Logarithmic decrement, 96, 100. 


Longitudinal mass, 476. 

— waves, 64. 

in bars, 180. 

Lorentz contraction, 243, 244. 

— transformation, 240, 249, 260. 

consequences of, 242. 

— triplet, 667. 

Loschmidt's number. See Avogadro's 
number. 

^-mesons, 779. 

Macro-state, 627. 

Magnetic domains, 462. 

— lens, 431, Ex. 112. 

— permeability, electron theory of, 450. 

— reluctance, 308. 

Magnetization by rotation, 464. 
Magnetized shell, 303. 

Magnetocalorio phenomena, 528. 
Magnetomeehanical parallelism, 464. 
Magnetomotive force, 308. 

Magneton, Bohr, 469, 665, 744. 

—7 Weiss, 459. 

Magnetostatic field, 297. 

Magnification, angular, 401. 

— in depth, 400. 

— transverse, 399. 

Magnus effect, 211. 

Malus, law of, 397. 

Manometer method, 290. 

Mass action, law of, 422, 539, 542. 

— correction, 765 n. 

— defect, 765 n. 

— number, 765 n. 

— of photon, 688. 

— spectrograph, 441, 442. 

— variability of, 262. 
matrix calculus, 46. 

Mattauch’s laws, 761. 

Matter waves, 693. 

Maxwell-Boltzmann law, 588. 

Maxwell’s equations, 311, 312, 467. 

— relation, 331. 

Mayer’s equation, 505. 

Mean anomaly, 137. 

— free path, 567. 

— relative velocity, 666. 

Media, anisotropic, propagation in, 360. 
Meissner and Ochsenfeld, 466. 

Meitner, 778. 

Melting-point curve, 534. 

Membranes, 183. 

Mesons, 778. 

Metals, optics of, 357. 



INDEX 


881 


Metals, properties of, 726. 

— reflecting power of, 369. 
Miehelson-Gale experiment, 478. 

— -Morley experiment, 237, 477. 

— stellar interferometer, 376, 863, Ex. 98. 
Micro-state, 627. 

Millikan oil-drop method, 426, 426. 
Mobility of an ion, 419. 

Madu’ated vibrations, 67. 

Modulus of compression, 173. 

— of elasticity, adiabatic, 608. 

isothermal, 608. 

Molecular diameter, 672. 

Moment of deviation, 166. 

— of inertia, 146-9. 

principal, 160. 

— of torsion, 147, 297. 

Momentum, 84. 

Monogenic function, 69. 

Moseley’s law, 666. 

Moving bodies, electrodynamics of. 471. 
Moving coil.induction in, 472. 

Multiple products, 16. 

Multiplets, theory of, 669. 

Multiply periodic systems, 128. 

Muscular action model, 805. 

Mutual inductance, 313. 

— induction, 313. 

coefficient of, 313. 

Nabla operator, 30. 

Nernst heat theorem, 663. 

Neumann function, 830, Ex. 26. 

NoutriDO, 778. 

Neutron, 752. 757. 

— 'mass of, 753. 

Newton (unit), 266., 

Newton’s laws of motion, 84. 

— third law, 108. 

Nicol prism, 374, 375. 

Non-conservative forces, 88. 
Non-divergent field, 302. 

Non-irrptational motion, 198 n. 

Normal acceleration, 82. 

— surface, 368. 

Nuclear attraction, 764, 766. 

— electrons, objections to, 767. 

— isomerism, 763. 

— magneton, 744. 

— moment, 743, 744, 772. 

— physics, 739. 

Nucleus, 431, 645, 646. 

— artificial transformation of, 747. 

— excitation of, 747. 


Nucleus, liquid-drop model of, 763. 

— motion of, 661. 

Nutation, 164. 

— cone of, 158. 

— of force-free top, 158. 

Objeot distance, 399. 

— 'spaoe, 397. 

Oersted (unit), 298. 

Ohm’s law, 293. 

for eleotrolytes, 417. 

for metals, 446, 447. 

Onnes, H. K., 406. 

Onsager theory of electrolytic conduc- 
tivity, 421, 423. 

Operator V , 30. • 

Optical. path, 395. 

— systems, resolving power of, 409. 
Optics, electron, 443. 

— geometrical, 395. 

— interference, 396. 

Orbits, electronic, perturbation of, 662. 

— hyperbolic, 702. 

Orthogonality, 189. 

Ortho-hydrogen, 862, Ex. 129. 
Orthotomic system, 264. . 

Oscillations of plasma, 801. 

Oscillator, stabilization of, 794. 
Osculating plane, 19. 

Osmotic pressure, 547, 594. 


v-mesons, 779. 

Packing fraction, ,765. 

Parabolic velocity profile, 215. 
Para-electric susceptibility, 280, 450. 
Para-hydrogen, 862, Ex. 129. 
Paramagnetic, 460. 

Pure'ectric susceptibility, 460. 

Partial pressures, Dalton’s law of, 496. 
Paschen-Back effect, 667, 743. 

Paschen series of spectral lines, 660. 
Paschen’s law, 436. 

Pauli and Sommerfield, electron theory of, 
449. 

— principle, 632, 679, 715. 

Pedal-surface, 370. 

Perihelion, 136, 137, 260. 

Periodic system, structure of, 674. 

— systems, 128. 

Permeability, 298. 

— magnetic, electron theory of, 460. 
Permittivity, 267. 

Perpetuum mobile, of first kind, 602. 



INDEX 


Perpetuum mobile, of second kind, 617, 
619, 621, 869, Kx. 123. 

Perturbation, 731. 

— of electron orbits, 662. 

Perturbations, theory of, 707. 

Phase, 493. 

— change of, in reflection, 366. 

— difference, 61. 

— integral, 130. 

— orbit, 604. 

— space, 684. 

— velooity, 694. 

Phosphorescence, 736. 

Photo-electrons, 433. 

Photon, 688. 

Pickering series of spectral lines, 663. 
Piezoelectric constants, 792. 

— moduli, 793. 

Piezoelectricity, 792. 

Planck’s law of radiation, 621, 622. 

— quantum of action, 432. 

Plane ciroulatory motion, 207. 

— mathematical pendulum, 104. 

— wave, 62. 

Planetary motion, 90. 

Plasma, oscillations of, 801. 

Point charge, 276. 

PoiaeuiUe’s law, 214, 216. 

Poisson’s equations, 268. 

— ratio, 171. 

Polarization, 280, 284, 360, 628. 

— dielectric, 461, 473. 

— electrical, 460. 

— elliptical, in reflection, 366. 

— linear, in reflection, 366. 

— magnetic, 460. 

— vector, 460. 

Polhode, 141, 142. 

Polygon of forces, 145. 

Ponderomotive forces, 285. 

Porous-plug experiment, 609. 

Positive rays, 440. 

Positron, 762, 764, 767. 

Potential, 87, 88. 

— barriers, 721. 

— elastic, 176, 176. 

— energy, 113, 114. 

— excitation, 432, 649. 

— flow, 198. 

Gibbs, 626, 627, 632 n., 641. 

— Tetarded, 340. 

— thermodynamic, 624, 526. 

Power factor, 319. 

— of a lens, 403. 


Poynting veotor, 331, 332, 366. 
Precession, 167. 

— cone of, 168. 

— Larmor, 664. 

— of axis of earth, 168. 

Pressure, radiation, 613. 

Primary cosmic rays, 780. 

Principal extensions, 167. 

— normal, 19. 

— plane, 399. 

— point, 399. 

— rays, 786. 

— series, 660. 

— stress, 169. 

Principle of correspondence, 666. 

— of uncertainty, 628 n. 

Prism, Nicol, 374, 376. 

— resolving power of, 866, Ex. 108. 

— spectrograph, 409, Ex. 106, 866. 
Product of inertia, 149. 

Projectiles, deviation of, 839, Ex. 53. 
Propagation in anisotropic media, 360. 
Proper functions, 186. 

— time, 266. 

— values, 186. 

of wave equation, 698. 

Proton, 646 767. 

Pupil, entrance, 786. 

— exit, 786. 


Quantum, light, 432. 

— of action, Planck’s, 432. 

— statistics, classical, 603. 
Quantum-mechanical square ”, 683. 

Quartz vibrator, 797. 

Quaai-elastio forces, 93. 


Rabi, 772. 

Radiation pressure, 613. 

Radioactivity, 746, 

— artificial, 766. 

— induced, 766. 

Radius of gyration, 148 n. 

Raman effect, 689. 

in wave mechanics, 716. 

Raoult’s law, 646. 

Ray axes, 371. 

— surface, 369, 370. 

Rayleigh waves, 182. 

Reactanoe, 317. 

Rectifier, 434. 

Reference frame, uniformly rotating, 232. 
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Reflecting plane, 387. 

— power, 364. 

Reflection, optical law of, 349. 

Refraction and anisotropic medium, 371. 

— electron theory of index of, '460. 

— external conical, 373. 

— index of, 331. 

— internal conical, 373. 

— molecular, 466. 

— optical index of, 461. 

— optical law of, 349. 

Refraotivity, 466. 

Relative aperture, 866, Ex. 107. 
Relativistio mechanics, 228. 

Relativity, generalized theory of, 268. 
Reluotanoe, magnetio, 308. 

Residue, 74. 

Resistance, 293. 

Resistivity, 294. 

Resolution, limit of, 410. 

Resolving power, 384, 409. 

of prism, 866. 

Resonanoe, 96, 99, 319. 

— curve, 99, 102. 

— energies, 764, 762. 

Retarded potential, 340. 

Reversible process. 603. 

Revolving electrons, 463. 

Reynolds’s number, 216. 
Richardson-Eirucein-de Haas effect, 464. 
Right-handed elliptic motion, 69. 

“ Rita term ”, 669. 

Rdntgen and Eichenwald, experiments 
of, 476. 

Rotating earth, free fall on, 231. 

Rotation of a vector, 27 n. 
Rotation-vibration bands, 682. 

Rowland’s experiment, 474. 

Rutherford, 747. 

Rydberg number, 661. 

“ Rydberg terra ”, 669. 

Sagnao’s experiment, 478. 

Sargent’s formula, 747. 

Saturation, 461. 

— current, 438. 

Scalar, 7. 

— product, 10. 

Scattering power, 719. 

Schmidt limits, 774. 

Schmidt-Sohaler ourves, 774. 

Sohrddinger, 706. — 

— equation, 696, 722, 726, 727, 774. 
Scintillation method, 740. 


Scleronomous systems, 114. 

Sorew motion, 142. 

Second law of thermodynamics, 614. 
Secondary cosmic rays, 780. 
Self-induotanoe, 314. 

Self-induction, 313, 314. 

— coefficient of, 314. 

Shadow, geometric, 382. 

Sharp series, 660. 

Shear modulus, 173. 

Shell integral, 14. 

Signal velocity, 66. 

Simple harmonio vibration, 60. 

— pendulum, 103. 

Simultaneity, 242. 

Sine condition, 786. 

Single-ray direction, 373. 

Singular point, 73. 

Skin effect, 324. 

Smoke rings, 206. 

SneU’s law, 360. 

Soap bubble, 227. 

Solenoidal field, 289. 

Solid state, 607. 

Sound waves, 212. 

Space ourves, 17. 

— relativistio conception of, 240. 
Spark discharge, 433, 436. 

— spectra, 674 n. 

Specific heat, 603, 629, 696, 606. 

of solids, Debye’s theory, 608. 

rotational, 861, Ex. 129. 

— inductive capacity, 267 n., 276. 

— resistance, 294. 

Spectra, arc, 674. 

— rotation, 702. 

Spectral lines, 719. 

intensity relations of, 703. 

Spectrograph, prism, 409, 855. 
Spectrum of alkalis, 667. 

— of hydrogen, 662. 

— of ionized helium, 662. 

— order of, 384. 

— X-ray, 653. 

“ Sphere of influence ”, 566. 
Spherioal condenser, 274. 

— harmonios, 699, 819. 

relations between, 820. 

Spin of electron, 689. 

Spontaneous magnetization, 462. 
Standing waves, 67. 

Stark effect, 476, 662. 

Starting vortex, 2:1. 
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Statistics, Fermi-Dirac, 625, 631. 

— quantum, classical, 603. 
Stofan-Boltzmann law, 624. 

Steiner's theorem, 148. 

Stern and Gerlach, experiments of, 665. 
“ Stokes ” line, 721. 

Stokes resistance, 418, 419, 425. 

Stokes’s law, 218. 

of fluorescence, 721. 

— theorem, 25, 72, 203* 204. 

Strain, 162. 

— ellipsoid, 171. 

— tensor, 171. 

Stream function, 200. 

— lines, 194, 201. 

Stress, 167. 

— tensors, 171. 

Strings, transverse vibration of, 183. 
Subordinate series, first, 660. 

second, 660. 

Superconductivity, 466-70, 635. 
Superposition of plane waves, 68. 

Surface brightness, 614. 

— curl, 345. 

— invariant, 404. 

— tension, 222, 223. 

— waves, 182. 

— zonal harmonics, 699, 820. 

Surfaces of discontinuity, 43. 
Susceptibility, 280, 450. 453. 

— diamagnetic, 463. 

— ferromagnetic, 458. 

— paramagnetic, 458, 669. 

— parelectrio, 280, 455. 

Swinne formula, 747. 

Symbols in nuclear physics, 748 n. 
Systems of particles, 108. 

Tangent vector, 18. 

Tangential acceleration, 82. 

Taylor’s series, 76. 

Temperature, critical, 499. 

— scale, thermodynamic, 518. 

Tensor analysis, fundamental principles 
of, 32. 

— antisymmetric, 36. 

— ellipsoid, 37. 

— invariants of, 1 72. 

— symmetric, 36. 

Thermal radiation, 613. 

Thermionic tube, characteristic curve, 
799. 

Thermochemistry, 511. 

Thermodynamic equilibrium, 547. 


Thermodynamic potential, 624, 526. 
Thermodynamics, first law of, 501. 

— second law of, 514. 

— third law of, 553. 

Thomson’s theorem, 204, 205, 211. 
Timbre, 187. 

Time, relativistic conception of, 240. 
Top, spherical, 152. 

— symmetrical, 153. 

— theory of the, 161. 

Torricelli’s theorem, 841, Ex. 60. 
Torsion, 20. 

Total heat, 509. 

— reflection, 354. 

Transport numbers, 420. 

Transverse mass, 476. 

— waves, 64. 

Tronton and Noble, experiment of, 475. 
“ Tunnel effect ”, 747. 

Turbulence, 215, 21S. 

Turbulent motion, 198 n. 

Unbounded elastic media, waves in, 180. 
Uncertainty principle, 628 n. 
Undulatory propagation, 212. 

Unital stress, 167. 

Unit plane, 399. 

— vector, 9. 

Units, O.G.S., 266, 330. 

266, 330. 

Uranium 1, 721. 

Valve, electrical, 434. 

Van der Waals’ equation, 498. 

Van’t Hoff’s law, 547. 

Vapour pressure curve, 634. 

Variation, 78. 

Variations, calculus of, 75. 

Vector, components of, 8. 

, — concept of, 7. 

— diagram, 52. 

— differentiation of, 17. 

— gradient, 36. 

— multiplication by scalar, 8. 

— polarization, 45P. 

— potential, 302. 

— product, 12. 

Vectors, addition and subtraction of, 8. 

— equality of, 8. 

— polar, 15. 

Velocity, ray, 364. 

— resonance, 99 n. 

— vector, 81. 
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Vibration, plane of, 351. 

Virtual displacement, 114. 

— displacements, principle of, 113. 
Viscosity, 568, 571. 

— coefficient of, 214. 

Viscous fluids, 214. 

Vortex filament, 205, 206. 

— lines, 205. 

— motion, 203. 

— ring, 206. 

— strength, 206, 206. 

— tube, 206, 206. 

Wattless current, 319. N 
Wave equation, 328. 

proper values, 698. 

— normal, 371. 

— surface, 368, 369, 370. 

— velocity, 61 n., 245. 
Wave-length 62. 

“ Wave-length, De Broglie ”, 695. 
Waves, 61. 


Waves, elliptically polarized, 64. 

— linearly polarized, 64. 

W'eiss law, 629. 

Wiedemann and Franz, law of, 447, 449. 
Wien’s displacement law, 622. 

— experiment, 474. 

Wilson’s cloud chamber, 642. 

Wireless telegraphy, 341 . 

Work, 86. 

Work of escape, 431. 

World-tensor, antisymmetric, 261. 

World vectors, 247, 250. 

Young's modulus, 171. 

Yukawa's theory, 778. 


Zeeman effect, 663, 671, 742. 

anomalous, 667. 

longitudinal, 863, Ex. 133. 

Zone lens. 394. 

— plate, 390. 



